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Abstract 


The accelerating expansion of the Universe poses a major challenge to our understanding of 
fundamental physics. One promising avenue is to modify general relativity and obtain a new 
description of the gravitational force. Because gravitation dominates the other forces mostly 
on large scales, cosmological probes provide an ideal testing ground for theories of gravity. 
In this thesis, we describe two complementary approaches to the problem of testing gravity 
using cosmology. 

In the first part, we discuss the cosmological solutions of massive gravity and its gen¬ 
eralisation to a bimetric theory. These theories describe a graviton with a small mass, and 
can potentially explain the late-time acceleration in a technically-natural way. We describe 
these self-accelerating solutions and investigate the cosmological perturbations in depth, 
beginning with an investigation of their linear stability, followed by the construction of a 
method for solving these perturbations in the quasistatic limit. This allows the predictions 
of stable bimetric models to be compared to observations of structure formation. Next, we 
discuss prospects for theories in which matter “doubly couples” to both metrics, and examine 
the cosmological expansion history in both massive gravity and bigravity with a specific 
double coupling which is ghost-free at low energies. 

In the second and final part, we study the consequences of Lorentz violation during 
inflation. We consider Einstein-aether theory, in which a vector field spontaneously breaks 
Lorentz symmetry and couples nonminimally to the metric, and allow the vector to couple in 
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Abstract 


a general way to a scalar field. Specialising to infiation, we discuss the slow-roll solutions in 
background and at the perturbative level. The system exhibits a severe instability which places 
constraints on such a vector-scalar coupling to be at least five orders of magnitude stronger 
than suggested by other bounds. As a result, the contribution of Lorentz violation to the 
inflationary dynamics can only affect the cosmic microwave background by an unobservably 
small amount. 
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The effort to understand the universe is one of the very few things which lifts human 
life a little above the level of farce and gives it some of the grace of tragedy. 


Steven Weinberg, The First Three Minutes 




lam always surprised when a young man tells me he wants to work at cosmology; 

I think of cosmology as something that happens to one, not something one can choose. 


Sir William McCrea, Presidential Address, Royal Astronomical Society 



Introduction 


One of the driving aims of modern cosmology is to turn the Universe into a laboratory. By 
studying cosmic history at both early and late times, we have access to a range of energy scales 
far exceeding that which we can probe on Earth. It falls to us only to construct the experi¬ 
mental tools for gathering data and the theoretical tools for connecting them to fundamental 
physics. 

The most obvious application of this principle is to the study of gravitation. Gravity is by far 
the weakest of the fundamental forces, yet on sufficiently large distance scales it is essentially 
the only relevant player; we can understand the motion of the planets or the expansion of 
the Universe to impressive precision without knowing the details of the electromagnetic, 
strong, or weak nuclear forces. ^ As a result, we expect the history and fate of our Universe to 
be intimately intertwined with the correct description of gravity. For nearly a century, the 
consensus best theory has been Einstein’s remarkably simple and elegant theory of general 
relativity. This consensus is not without reason: practically all experiments and observations 
have lent increasing support to this theory, from classical weak-field observations such as the 
precession of Mercury’s perihelion and the bending of starlight around the Sun, to the loss of 

1 Modulo the fact that we need, as input, to know which matter gravitates, and that the quantum field theories 
describing these forces are essential to understanding precisely which matter we have. 
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orbital energy to gravitational waves in binary pulsar systems, observations remarkable both 
for their precision and for their origin in the strongest gravitational fields we have ever tested. 

Nevertheless, there are reasons to anticipate new gravitational physics beyond general 
relativity. In the ultraviolet (UV), i.e., at short distances and high energies, it is well known that 
general relativity is nonrenormalisable and hence cannot be extended to a quantum theory. It 
must be replaced at such scales by a UV-complete theory which possesses better quantum 
behaviour. The focus of this thesis is on the infrared (IR), i.e., long distances and low energies. 
While general relativity is a theoretically-consistent IR theory, the discovery in 1998 that the 
expansion of the Universe is accelerating presents a problem for gravitation at the longest 
distances [7, 8] . The simplest explanation mathematically for this acceleration is a cosmologi¬ 
cal constant, which is simply a number that we can introduce into general relativity without 
destroying any of its attractive classical features. However, from a quantum-mechanical point 
of view, the cosmological constant is highly unsatisfactory. The vacuum energy of matter 
is expected to gravitate, and it would mimic a cosmological constant; however, the value it 
would generate is as much as times larger than the value we infer from observations 
[9-11]. Therefore, the “bare” cosmological constant which appears as a free parameter in 
general relativity would need to somehow know about this vacuum energy, and cancel it out 
almost but not quite exactly. Such a miraculous cancellation has no known explanation. Alter¬ 
natively, one could imagine that the vacuum energy is somehow either rendered smaller than 
we expect, or does not gravitate—and theories which achieve this behaviour are known—but 
we would then most likely need a separate mechanism to explain what drives the current 
small but nonzero acceleration. 

For these reasons, it behooves us to consider the possibility that general relativity may not 
be the final description of gravity on large scales. To put the problem in historical context, we 
may consider the story of two planets: Uranus and Mercury. In the first half of the nineteenth 
century, astronomers had mapped out the orbit of Uranus, then the farthest-known planet, 
to heroic precision. They found anomalies in the observed orbit when compared to the 
predictions made by Newtonian gravity, then the best understanding of gravitation available. 
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Newton’s theory had not yet been tested at distances larger than the orhit of Uranus: it was, 
for all intents and purposes, the boundary of the known universe. A natural explanation was 
therefore that Newtonian gravity simply broke down at such unimaginably large distances, to 
be replaced by a different theory. In 1846, French astronomer Urbain Le Verrier put forth an 
alternative proposal: that there was a new planet beyond Uranus’ orbit, whose gravitational 
influence led to the observed discrepancies. Le Verrier predicted the location of this hitherto- 
unseen planet, and within weeks the planet Neptune was unveiled. 

Buoyed by his success, Le Verrier turned his sights to another planet whose orbit did 
not quite agree with Newtonian calculations: Mercury, the closest to the Sun. As is now 
famous, the perihelion of Mercury’s orbit processed at a slightly faster rate than was predicted. 
Le Verrier postulated another new planet, Vulcan, within Mercury’s orbit. However, the 
hypothesised planet was never found, and in the early parts of the twentieth century, Einstein 
demonstrated that general relativity accounted precisely for the perihelion precession. In the 
case of Mercury, it was a modification to the laws of gravity, rather than a new planet, which 
provided the solution. 

We find ourselves in a similar position today. Our best theory of gravity, general relativity, 
combined with the matter we believe is dominant, mostly cold dark matter, predict a decel¬ 
erating expansion, yet we observe something different. One possibility is that there is new 
matter we have not accounted for, such as a light, slowly-rolling scalar field. However, we 
must also consider that the theory of gravity we are using is itself in need of a tune-up. 

The project of modifying gravity leads immediately to two defining questions: what 
does a good theory of modified gravity look like, and how can we test such theories against 
general relativity? This thesis aims to address both questions, although any answers we find 
necessarily comprise only a small slice of a deep field of research. 

Einstein’s theory is a paragon of elegance. It is practically inevitable that this is lost when 
generalising to a larger theory. Indeed, it is not easy to even define elegance once we leave 
the cosy confines of Einstein gravity. Consider, as an example, two equivalent definitions of 
general relativity, each of which can be used to justify the claim that GR is the simplest possible 
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theory of gravity. First we can say that general relativity is the theory whose Lagrangian, 



( 1 . 1 ) 


is the simplest diffeomorphism-invariant Lagrangian that can be constructed out of the 
metric tensor and its derivatives. Alternatively, we could look at general relativity as being the 
unique Lorentz-invariant theory of a massless spin-2 field, or graviton [12-16]. 

These serve equally well to tell us why general relativity is so lovely, but they diverge once 
we move to more general theories. Consider, for example, modifying the Lagrangian (1.1) by 
promoting the Ricci scalar i? to a general function f{R], 




( 1 . 2 ) 


This is the defining feature of f[R] gravity, a popular theory of modified gravity [17-19]. One 
can certainly make the argument that this is mathematically one of the simplest possible 
generalisations of general relativity. However, when considered in terms of its fundamental 
degrees of freedom, we find a theory in which a spin-0 or scalar field interacts in a highly 
nonminimal way with the graviton. 

Alternatively, one can consider massive gravity, in which the massless graviton of general 
relativity is given a nonzero mass. While this has a simple interpretation in the particle picture, 
its mathematical construction is so nontrivial that over seven decades were required to finally 
find the right answer. The resulting action, given in equation (2.21), is certainly not something 
one would have thought to construct had it not been for the guiding particle picture. 

There are additional, more practical concerns when building a new theory of gravity. 
General relativity agrees beautifully with tests of gravity terrestrially and in the solar system, 
and it is not difficult for modified gravity to break that agreement. While this may be surprising 
if we are modifying general relativity with terms that should only be important at the largest 
distance scales, it is not difficult to see that this problem is fairly generic. Any extension of 
general relativity involves adding new degrees of freedom (even massive gravity has three extra 
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degrees of freedom), and in the absence of a symmetry forbidding such couplings, these will 
generally couple to matter, leading to gravitational-strength fifth forces. Such extra forces are 
highly constrained by solar-system experiments. Almost all viable theories of modified gravity 
therefore possess screening mechanisms, in which the fifth force is large cosmologically but is 
made unobservably small in dense environments. The details of these screening mechanisms 
are beyond the scope of this thesis, and we refer the reader to the reviews [20, 21]. 

In parallel with these concerns, we must ask how to experimentally distinguish modified 
gravity from general relativity. One approach is to use precision tests in the laboratory [22-29]. 
Another is to study the effect of modified gravity on astrophysical objects such as stars and 
galaxies [30-33]. In this thesis we will be concerned with cosmological probes of modified 
gravity. Because screening mechanisms force these modifications to hide locally (with some 
exceptions), it is natural to look to cosmology, where the new physics is most relevant. Cosmo¬ 
logical tests broadly fall into three categories: background, linear, and nonlinear. Background 
tests are typically geometrical in nature, and try to distinguish the expansion history of a new 
theory of gravity from the general relativistic prediction. Considering small perturbations 
around the background, we obtain predictions for structure formation at linear scales. Finally, 
on small scales where structure is sufficiently dense, nonlinear theory is required to make 
predictions, typically using N-body computer simulations. 

This thesis is concerned with the construction of theoretically-sensible modified gravity 
theories and their cosmological tests at the level of the background expansion and linear 
perturbations. In the first part, we focus on massive gravity and its extension to a bimetric 
theory, or massive bigravity, containing two dynamical metrics interacting with each other. 
In particular, we derive the cosmological perturbation equations for the case where matter 
couples to one of the metrics, and study the stability of linear perturbations by deriving a 
system of two coupled second-order evolution equations describing all perturbation growth 
and examining their eigenfrequencies. Doing this, we obtain conditions for the linear cos¬ 
mological stability of massive bigravity, and identify a particular bimetric model which is 
stable at all times. We next move on to the question of observability, constructing a general 
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framework for calculating structure formation in the quasistatic, subhorizon regime, and then 
applying this to the stable model. 

After this, we tackle the question of matter couplings in massive gravity and bigravity, 
investigating a pair of theories in which matter is coupled to both metrics. In the first, 
matter couples minimally to both metrics. We show that there is not a single effective metric 
describing the geometry that matter sees, and so there is a problem in defining observables. 
In the second theory, matter does couple to an effective metric. We first study it in the context 
of bigravity, deriving its cosmological background evolution equations, comparing some of 
the simplest models to data, and examining in depth some particularly interesting parameter 
choices. We next examine the cosmological implications of massive gravity with such a matter 
coupling. Massive gravity normally possesses a no-go theorem forbidding flat cosmological 
solutions, but coupling matter to both metrics has been shown to overcome this. We examine 
this theory in detail, finding several stumbling blocks to observationally testing the new 
massive cosmologies. 

The remainder of this thesis examines the question of Lorentz violation in the gravitational 
sector. We focus on Einstein-aether theory, a vector-tensor model which spontaneously 
breaks Lorentz invariance. We study the coupling between the vector field, or “aether,” and 
a scalar field driving a period of slow-roll inflation. We find that such a coupling can lead 
to instabilities which destroy homogeneity and isotropy during inflation. Demanding the 
absence of these instabilities places a constraint on the size of such a coupling so that it must 
be at least 5 orders of magnitude smaller than the previous best constraints. 

The thesis is organised as follows. In the rest of this chapter, we present background 
material, discussing the essential ingredients of general relativity and modern cosmology 
which will be important to understanding what follows. In chapter 2 we give a detailed 
description of the modified gravity theories discussed in this thesis, specifically massive 
gravity, massive bigravity, and Einstein-aether theory, focusing on their defining features emd 
their cosmological solutions. In chapters 3 and 4 we examine the cosmological perturbation 
theory of massive bigravity with matter coupled to one of the metrics. In chapter 3 we study 
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the stability of perturbations, identifying a particular bimetric model which is stable at all 
times, while in chapter 4 we turn to linear structure formation in the quasistatic limit and 
look for observational signatures of bigravity. In chapters 5 to 7 we examine generalisations of 
massive gravity and bigravity in which matter couples to both metrics. Chapter 5 focuses on 
the thorny problem of finding observables in one such theory. In chapter 6 we examine the 
background cosmologies of a doubly-coupled bimetric theory, and do the same for massive 
gravity in chapter 7. Finally, in chapter 8 we study the consequences of coupling a slowly- 
rolling inflaton to a gravitationeil vector field, or aether, deriving the strongest bounds to date 
on such a coupling. We conclude in chapter 9 with a summary of the problems we have 
addressed and the work discussed, as well as an outlook on the coming years for modified 
gravity. 


1.1 Conventions 

Throughout this thesis we will use a mostly-positive (--1—I—H) metric signature. We will de¬ 
note the flat-space or Minkowski metric by rj^y. Greek indices /i, v,... = (0,1,2,3) represent 
spacetime indices, while Latin indices = (1,2,3) are used for spatial indices. Latin 

indices starting from a,b,c,... are also used for field-space and local Lorentz indices. Partial 
derivatives are denoted by d and covariant derivatives by V. Commas and semicolons in 
indices will occasionally be used to represent partial and covariant derivatives, respectively, 
i.e., 4>^n = d^(j) and Symmetrisation and antisymmetrisation are denoted by 

S(//r) = ~ T Syjj^ , — 2 , (1.3) 

and similarly for higher-rank tensors. In lieu of the gravitational constant G we will frequently 
use the Planck mass, Mpj = I/SttG. Cosmic time is denoted by t and its Hubble rate is H, 
while we use t for conformal time with the Hubble rate For brevity we will sometimes use 
abbreviations for common terms, listed in table 1.1. 



10 


Chapter 1. Introduction 


Abbreviation 

Expression 

BAG 

baryon-acoustic oscillations 

CDM 

cold dark matter 

CMB 

cosmic microwave background 

ELRW 

Eriedmann-Lemaitre-Robertson-Walker 

GR 

general relativity 

SNe 

supernovae 

VEV 

vacuum expectation value 


Table 1.1: Abbreviations used throughout this thesis. 

1.2 General Relativity 

This thesis deals with modified gravity. Consequently it behoves us to briefiy overview the 
theory of gravity we will be modifying: Einstein’s general relativity. The theory is defined by 
the Einstein-Hilbert action, 

j d^x^R, (1.4) 

where R — is the Ricci scalar, with and the metric tensor and Ricci tensor, re¬ 

spectively. Allowing for general matter, represented symbolically by fields 4>,' with Lagrangians 
determined by particle physics, the total action of general relativity is 

S^SYti + ^ (1-5) 

Varying the action S with respect to gf^'’ we obtain the gravitational field equation, the Einstein 
equation, 

R,,v-^Rgiiv^8nGT^y, ( 1 . 6 ) 

where the stress-energy tensor of matter is defined by 

_ 2 5 


(1.7) 
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It is often convenient to define the Einstein tensor, 


1 



( 1 . 8 ) 


which is conserved as a consequence of the Bianchi identity, 


V^G^ = 0. 


(1.9) 


Note that we are raising and lowering indices with the metric tensor, g^y. The Bianchi identity 
is a geometric identity, i.e., it holds independently of the gravitational field equations. The 
stress-energy tensor is also conserved. 


V^^^ = 0. 


( 1 . 10 ) 


This is both required by particle physics and follows from the Einstein equation and the 
Bianchi identity, which is a good consistency check. A consequence of stress-energy conser¬ 
vation is that particles move on geodesics of the metric, g^y, 


+ - 0 - 


( 1 . 11 ) 


where x'^(A) is the position 4-vector of a test particle parametrised with respect to a parameter 
A, an overdot denotes the derivative with respect to A, and Kp = \g‘"'’igav,p + gpv.a - gap,v) 
are the Christoffel symbols. 

Einstein’s equation relates the curvature of spacetime to the distribution of matter. Ereely- 
falling particles then follow geodesics of the metric. The combination of the Einstein and 
geodesic equations leads to what we call the gravitational force. John Wheeler’s description of 
gravity’s nature is perhaps the most eloquent: “Spacetime tells matter how to move; matter 
tells spacetime how to curve’’ [34]. 


As discussed above, it seems clear to the eye that equation (1.4) is the simplest action 
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one can construct for the gravitational sector, if one restricts oneself to scalar curvature 
invariants. Indeed, the simplicity of general relativity can be phrased in two equivalent ways. 
Lovelock’s theorem states that Einstein’s equation is the only gravitational field equation 
which is constructed solely from the metric, is no more than second order in derivatives, is 
local, and is derived from an action [35] . Alternatively, as alluded to previously, the same field 
equations are the unique nonlinear equations of motion for a massless spin-2 particle [12- 
16]. There is but one extension to the gravitational action presented which neither violates 
Lovelock’s theorem nor introduces any extra degrees of freedom: a cosmological constant. A, 
which enters in the action as 



A represents an infrared, or low-energy, modification to general relativity, as it only be¬ 
comes important for small curvatures, R < 2A. Because the smallest spacetime curvatures 
are found at large distances and late times, the most important effect of a cosmological con¬ 
stant is, as the name suggests, on cosmology. As we will see in the next section, a positive 
cosmological constant has the predominant effect of causing the expansion of the Universe 
to accelerate at late times. The latest data suggest that, if a cosmological constant is respon¬ 
sible for the present cosmic speed-up, then it has an incredibly tiny value in Planck units, 
A/Mpj ~ [36]. Note that if there is a different explanation for the cosmic acceleration, 

such as dynamical dark energy or modified gravity, then A is typically assumed to be negligible. 
Indeed, since it is our aim in much of this thesis to explore modified gravity as an alternative 
to the cosmological constant, we will set A = 0 throughout, except when noted.^ 


^The question of why A should be zero, given that it generically receives large, ^?(Mpj) quantum corrections, 
is one of the greatest mysteries in modern physics, but is far beyond the scope of this thesis. Consequently we 
do not speculate about what mechanisms to remove the cosmological constant may be in play. 
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1.3 The Cosmological Standard Model 

Observations of the cosmic expansion history are, if we include a small cosmological constant, 
extremely well described [36] by general relativity with a Friedmann-Lemaitre-Robertson- 
Walker (FLRW) ansatz for the metric, 

d— —N[tf d+ a[tf f , (1-13) 

where we have used spherical polar coordinates, d^l — dO^ + sin^ 9dcf)^ is the metric on a 
2-sphere, and k parametrises the curvature of spatial sections. The FLRW metric is defined 
by two functions of time: the lapse, N[t), and the scale factor, a[t). It is a natural metric to 
use for cosmology as it is the most general metric consistent with spatial homogeneity and 
isotropy, i.e., the principle that there should be neither a preferred location nor a preferred 
direction in space. These principles are consistent with observations on scales larger than 
about 100 megaparsecs. 

In general relativity, we have the freedom to choose our coordinate system, and so the 
lapse can be freely “set” to a desired functional form /(f) by rescaling the time coordinate as 
df —> f{t)dt/N{t). Two common choices for the time coordinate are cosmic time, N{t) — 1, 
and conformal time, iV(f) = a(f). Cosmic time is more physical, as it corresponds to the 
time measured by observers comoving with the cosmic expansion (such as, for example, 
us). Conformal time is often computationally useful, especially since in those coordinates 
the metric is conformally related to Minkowski space if k = 0, = a[tYriny-, consequently, 
photons move on flat-space geodesics, and their motion in terms of conformal time can be 
computed without additionally calculating the cosmic expansion. It is worth keeping the 
lapse in mind because this thesis deals in large part with theories in which the time-time 
part of the metric (or metrics) cannot so freely be fixed to a desired value. As long as a single 
metric couples to matter, however (which is the case everywhere except in chapter 5), the time 
coordinate defined hy dz — N[t)dt, where N[t) is the lapse of the metric to which matter 
couples, will function as cosmic time when computing observables. 
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The dynamical variable determining the expansion of the Universe is the scale factor, a[t]. 
Its evolution is determined by the Einstein equation. At this point we specialise to cosmic 
time; the conformal-time equivalents of the equations we present can be easily derived by 
switching the time coordinate from t to t, where dz — d We use as the matter source 

a perfect fluid with the stress-energy tensor 

T^'’— {p + p) u'’+ p, (1.14) 


where is the fluid 4-velocity, p is the energy density, and p is the pressure. Then, taking 
the time-time component of the Einstein equation we obtain the Friedmann equation. 




SttG 


K A 
3 


(1.15) 


where the Elubble rate is defined by 



Conservation of the stress-energy tensor leads to the fluid continuity equation. 


(1.16) 


p -H 3U (p -H p) = 0. 


(1.17) 


Note that, in a Universe with multiple matter species which do not interact, this conservation 
equation holds both for the total density and pressure, as well as for the density and pressure 
of each individual component. The spatial part of the Einstein equation—or, equivalently, the 
trace—will lead to the acceleration equation. 


d , . A 

- = -47rG (p+3p) + -. 

d/ v5 


(1.18) 


This can also be derived from the Friedmann and continuity equations, hence its utility is 
limited for the purposes of this thesis. 


In order to close the system comprising the Friedmann and continuity equations, it is 
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typical to specify an equation of state relating the pressure to the density, p = pip), for each 
individual matter species. The simplest and most commonly-used equation of state is 

p — wp, (1.19) 

where u/ is a constant. Examples of perfect fluids obeying such an equation of state include 
pressureless matter, or “dust,” with w = 0, radiation, with w — 1/3, and vacuum energy or a 
cosmological constant, with w —- I. Indeed, we only need these three fluids to model the 
Universe back to about a second after the big bang, so we will focus on them. 

Let us briefly discuss some simple properties of the cosmological solutions to this system 
of equations. Because observations are consistent with a flat Universe, i.e., k — 0 [36], we 
will neglect the spatial curvature from here out. With w — const., the continuity equation is 
solved by 

p=Poa-='fi+'"’, (1.20) 

where po is a constant corresponding to the density when a — I (usually taken to be the 
present day). This leads to the following behaviours for the relevant cosmic fluids: 


p ~ 

radiation 

(1.21) 

p ~ a~^ 

dust 

(1.22) 

p — const. 

vacuum energy, cosmological constant. 

(1.23) 

Plugging these into the Friedmann equation, we obtain the following expansion rates during 

the various cosmic eras: 

a(t)~ 

radiation-dominated era 

(1.24) 

a(t)~ 

matter-dominated era 

(1.25) 

a(t)~ 

A-dominated era. 

(1.26) 
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In general, a universe dominated by a u/ = const. / -1 perfect fluid will evolve as a[t) ~ 

f2/3{l+w)^ 

Notice that, as time goes on, the densities of radiation and matter (and any fluid with 
w> -1) will decay, while that of a cosmological constant stays the same (which is sensible, 
since it has a constant contribution to the Friedmann equation). Therefore, if A > 0, there is 
necessarily a time after which the cosmological constant dominates the Friedmann equation, 
with H ~ const, and an exponential expansion. This makes quantitative the claim from above 
that a cosmological constant leads to late-time cosmic acceleration. Observations show that 
such a late-time acceleration is happening in our own Universe, and if it is caused by a perfect 
fluid then its equation of state is consistent with w — —l [36]. Following this, our criterion for 
self-acceleration in a theory of modified gravity will generally be that H tends to a constant at 
late times. 


Finally, we note that it is common to define a density parameter, U,, for each matter 
species. 


^ _8nGpi 
3U2 


(1.27) 


where the subscript i indexes each matter species. In peirticular, we will define Um, Ub, Uc 
and Ua for all matter (specifically dust), baryons, cold dark matter, radiation, and a putative 
dark energy, respectively. In terms of the density parameter, the Friedmann equation can be 
written in the simple and general form 


= ( 1 - 28 ) 
i 

as long as we define appropriate density parameters for the curvature and cosmological- 
constant terms. It is also common to parametrise the present-day density of each species in 
terms of the density parameter evaluated at the present, denoted by Broadly speaking, 

observations suggest Um.o ~ 0.3 and Ua,o ~ 0.7, while all other contributions are negligibly 
small or vanishing [36] . The fact that Ua,o is nonzero tells us that in order to match observations 
using general relativity, we need to introduce a “dark energy” component, of which the 
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^b,0^ 

0.022 


0.12 


^(10-5) 

Ha,o 

0.68 

Ho 

68.14 km/s/Mpc 


Table 1.2: The Planck best-fit cosmological parameters, taken from Ref. [36]. Here Hq — 
lOOfi km/s/Mpc. 


simplest example is a cosmological constant. The precise best-fit cosmological parameters 
from the Planck satellite are presented in table 1.2. 

We have progressively constructed the cosmological standard model, or A-cold dark mat¬ 
ter (ACDM). Its main ingredients are radiation (which is important mostly at early times), 
baryons and cold dark matter comprising pressureless dust, and a small cosmological con¬ 
stant. The gravitational theory is general relativity. It is the aim of this thesis to explore 
alternatives in which the cosmological constant is removed at the expense of introducing a 
different gravitational theory. 


1.4 Linear Perturbations around FLRW 

The FLRW metric was constructed to be consistent with spatial homogeneity and isotropy. 
The Universe is, of course, not really homogeneous and isotropic: in various places it contains 
stars, planets, galaxies, people, and Cambridge. The FLRW approximation holds on scales 
of hundreds of megaparsecs and higher, and breaks down at smaller distances. At slightly 
smaller distance scales, spacetime is well described by linear perturbations to FLRW. That is, 
taking to be an FLRW background metric, we consider 

gliv + 5gf,y, (1.29) 

where 5 g^y < 1 is a small perturbation, add a similar small piece to the matter sector, and 
calculate the Einstein equations for dg^v This proves to be a powerful tool for testing gravity: 
using probes of structure to test gravity at the linear level complements and can even be more 
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constraining than studies of the expansion history which operate at the background level. 

Let us be more explicit. We will work in conformal time (iV = a) and write the perturbed 
metric as 

gfj^vdx^dx'’ — + E)dT^ + 2diFdtdx' + [(l+yl)5/j dx'dx^]-. (1.30) 

We define the perturbed stress-energy tensor by 

r°o = -p(l + 5), ro^-[p + P)v\ 

T\^[p+P)iVi+diF), rj^[P+5p)5^j+rj, = (i.si) 

where v‘ = dx'/dt and barred quantities refer to background values. Let us specialise to 
pressureless dust {P — 5P — S'y = 0). 

Because of the coordinate independence of general relativity, not all of the perturbation 
variables represent genuine degrees of freedom: as we have things currently set up, it is 
possible for some of the perturbations to be nonzero while the spacetime is still purely FLRW, 
only written in funny coordinates. This could lead to unphysical modes propagating through 
the equations of motion. To remove this problem, we choose a coordinate system, or fix 
a gauge. We will work in conformal Newtonian gauge, in which F — B — 0. We will also 
decompose each variable into Fourier modes and suppress the mode index; the only effect 
of this for our purposes is that we can write 5‘Widj^ — where $ represents any of the 
perturbations. 

We are left with four perturbation variables. A, E, 5, and 8 = diV\ There is only one 
dynamical degree of freedom among these; any three of the variables can be written in 
terms of the fourth, which in turn obeys a second-order evolution equation. We will take 
this independent degree of freedom to be 5. By taking the off-diagonal part of the space- 
space Einstein equation, we can find that A — —E. The potential E is related to the density 
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perturbation, 5, by the time-time Einstein equation, 



(1.32) 


Most of the modes which we can access observationally are within the horizon, k > . To sim¬ 

plify the analysis we can focus on these modes by taking the subhorizon limits, 
and the quasistatic limit, ~ $ ~ 4>, where again $ represents any of the perturbation 

variables. In this limit, equation (1.32) takes the simple form 



(1.33) 


We can recognise this as the Poisson equation; the matter density contrast, 5, sources the 
gravitational potential, E, in a familiar way. 

We additionally have the v = 0 and v = i components of the stress-energy conservation 
equation. 



(1.34) 


(1.35) 


where 0 = diV\ In the subhorizon and quasistatic limit, these can be combined, along with 
the Poisson equation, to obtain a closed evolution equation for the density contrast,^ 



(1.36) 


The evolution equation for 5 can be integrated to obtain the growth rate of structure. 
While in general this requires numerical integration, as an illustration we can obtain exact 
solutions in general relativity during the various cosmic eras. During matter domination, we 

^Outside the quasistatic limit, this evolution equation will be sourced by E, which in turn obeys its own 
closed equation. Notice that there is still only one independent degree of freedom. 
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have p 3MpjJ^f2/a2, a ~ and M" = 2/t, so equation (1.36) becomes 


.. 2 . 6 

5 H—5- -5 — 0, 

T 


(1.37) 


which has, in addition to a decaying mode which we ignore, the growing solution 


5 ~ ~ a. 


(1.38) 


During dark energy domination, p (which is the density of matter) becomes negligibly small, 
and the only solutions to equation (1.36) are a decaying mode and 5 — const. We see that 
during the dark energy era, matter stops clustering. This makes intuitive sense: as the 
expansion of the Universe accelerates, it becomes more and more difficult for matter to 
gravitationally cluster “against” the expansion. 

A useful parametrisation for comparison to data is based on the growth rate. 


fia,k] = 


d\og5 
d\oga' 


(1.39) 


In the recent past, the growth rate of solutions to equation (1.36) is well approximated in 
terms of the matter density parameter defined above. 


f{a,k)^ni, (1.40) 

where the growth index y has the value y 0.545. The growth index typically deviates from 
this in theories of modified gravity. 


1.5 Inflation 

To this point we have discussed some of the essential ingredients of modern cosmology, par¬ 
ticularly general relativity and its application to background and linearised FLRW spacetimes. 
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We then used this to discuss the expansion history and the growth of structure in the “late 
Universe,” i.e., during the matter- and dark-energy-dominated eras. The standard cosmolog¬ 
ical model includes, at earlier times, two other important eras: radiation-domination and, 
before it, inflation. We will skip the radiation era, as it is not directly relevant to this thesis. 
This leaves inflation. 

The original motivations of inflationeiry cosmology were that it solves some of the glaring 
problems with a big-bang cosmology, which can be summarised as problems of initial con¬ 
ditions: a Universe that was always decelerating (until the recent dark-energy era) requires 
highly tuned initial conditions to be as flat and uniform as we see it. After inflation was 
initially developed, another significant motivation for inflation arose: quantum fluctuations 
during inflation are blown up to sizes Icirger than the cosmic horizon before they can average 
out, leaving a spectrum of perturbations which would seed the formation of cosmic structure, 
in excellent agreement with observations. For a comprehensive review of these motivations 
and inflationary physics, we point the reader to Ref. [37]. 

The simplest physical model for inflation, and the one with which we are concerned here, 
is single-field slow-roll inflation. In this model, inflation is driven by a canonical scalar field or 
inflaton, cj), with a potential V[cj)). The scalar has units of mass. We incorporate the scalar 
field by taking the gravitational action (1.5) with the matter Lagrangian given by 



(1.41) 


We will usually leave the dependence of the potential on cp implicit and simply write V. The 


Einstein equation (1.6) is sourced by the stress-energy tensor, defined in equation (1.7). For 
the scalar field action (1.41) this yields 



(1.42) 


Finally, by varying the action with respect to (p (or, equivalently, by calculating the Euler- 
Fagrange equation for Sd(j,) we obtain the equation of motion for the scalar field, or the 
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Klein-Gordon equation, 

(1.43) 

where □ = is the D’Alemhertian operator, and 1^ = dV/dcp. The Einstein and 

Klein-Gordon equations completely determine the behaviour of the two dynamical fields, g^y 
and (p. 

Now let us specialise to homogeneous and isotropic cosmology. As argued above, the 
metric must take the FLRW form. The scalar can only depend on time, (p [x^^] = ^ (t); if it were 
to depend on space, then it would break homogeneity or isotropy (or both) and communicate 
that violation to the metric, through the Einstein equation. With this metric, the stress-energy 
tensor can be written in the perfect-fluid form (1.14). Doing this we can identify the density 
and pressure of the scalar field. 


P = 


cP'^ 


2N^ 


V, 


A 

2N^ 


(1.44) 


The most essential condition for solving the big-bang initial condition problems and 
generating the observed cosmic structure is that the spacetime geometry during inflation 
be very close to de Sitter space, the vacuum solution of Einstein’s equations with a positive 
cosmological constant. This corresponds to an FLRW spacetime with a constant Hubble rate 
in cosmic time, H — const. Therefore in order to be a good driver of inflation, the inflaton, 
(p, must source the Friedmann equation in a way close to a cosmological constant.^ Recall 
from the previous section that a cosmological constant enters the Friedmann equation as 
a constant, while matter species contribute their density to this equation; therefore the 
condition for inflation is that the density of the scalar be nearly constant. As we have seen, 
this requires 



— V 

2N^ 

— + v 

2N'^ ^ 


- 1 . 


(1.45) 


'‘It cannot be exactly a cosmological constant as there would be no physical clock to distinguish one time 
from the next, and inflation could never end. This is why we need a scalar field with dynamics, rather than just a 
cosmological constant term. 
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It is clear that this condition is satisfied when <€. V. This is the slow-roll condition: the 

scalar field must be rolling very slowly, compared to the size of its potential. It is not difficult 
to see that this does what we want: if the scalar does not move quickly, then V{cj)) is very close 
to a constant, and the scalar field density will look very similar to a cosmological constant 
term. 


We can understand this condition in terms of the scalar field microphysics, i.e., as a 
condition on the potential, by utilising the Friedmann and Klein-Gordon equations. In the 
FLRW background these take the forms 






K 






d N \ . „ 


(1.46) 

(1.47) 


where H = d/{aN) is the cosmic-time Hubble parameter. At this point we will specialise, for 
simplicity, to cosmic time {N — \), although when relevant we will present results in terms 
of conformal time {N — a) as well. The expressions presented up to this point, keeping N 
general, will be necessary in chapter 7 when we consider a scalar field in a theory where we 
cannot freely rescale N. 


By taking a derivative of the Friedmann equation and removing terms using the Klein- 
Gordon equation, we obtain 






2MlP 


(1.48) 


This formalises the result we had derived less rigorously earlier: if (p is small, then H is nearly 
constant. But cp^ is dimensionful, so what do we mean by “small?” We have already argued 
that cp^ should be small compared to the potential, V. Moreover, in this slow-roll limit, the 
Friedmann equation becomes 


3W 




( 1 . 49 ) 
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Using this equation and the expression for H we can write the slow-roll condition as 


H 


s = - 


H2 


< 1 , 


(1.50) 


where s is the slow-roll parameter. 


It turns out not to be enough to demand that s be small: it must also be small for a 
sufficiently long period of time. If it were not, inflation would not last very long, and would be 
unable to solve the initial-conditions problems or produce cosmic structure over the range of 
scales that we observe. We formalise this condition by defining a second slow-roll parameter. 
Much the way that e is defined to demand that H be small,^ we define a new parameter, p, to 
measure the smallness of e, 




(1.51) 


Note that in conformal time, t, defining' = d/d t and using again for the conformal-time 
Hubble parameter = a'la, the slow-roll parameters are 


s’ 


(1.52) 


The full slow-roll limit of inflation can therefore be defined by demanding s,rj<.\. Observa¬ 
tions require that inflation last at least 50-60 e-folds, or ln(a//a,) >50-60, where a, and a / 
are the scale factors at the start and end of inflation, respectively. In the slow-roll limit, both e 
and rj are constant at first order and we can integrate to find, to first order in s, 


a — e 


fit 



2 


e + 0[e^] 


(1.53) 


H^H{l-Hts + 0is^)), 


(1.54) 


^We define e with a minus sign because, in order to satisfy the null-energy condition, we need H <0. 
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or, in conformal time, 



(1.55) 


T 


(1.56) 


where H is the Huhhle rate of the de Sitter background in the limit s —> 0. Note that in 


conformal time, t runs from -oo at the hig hang to 0 in the far future. In principle, “the far 
future” actually corresponds to the end of inflation, taken to be when f, r/ ~ 1 and the slow-roll 
expansion breaks down. This is assumed to be followed by a period of reheating, in which the 
scalar field decays into standard-model particles, and the radiation era thus commences. 


We now, finally, have the tools to understand the microphysics of a scalar field satisfying 
the slow-roll conditions. Using the expression (1.48) for H and the definition of the slow-roll 
parameters, we can find 



(1.57) 


Therefore, in addition to the aforementioned condition, (p^ <.V, we see that we must also 


demand (p < Hep. These correspond to f < 1 and ry < 1, respectively. The latter condition 
implies that the ^ term must be subdominant in the Klein-Gordon equation, so we can write 


its slow-roll version as 


3H(p 


(1.58) 


Using this expression and the slow-roll Friedmann equation, equation (1.49), we can write 
the slow-roll parameter as 



(1.59) 


Next, by taking the derivative of the slow-roll Klein-Gordon equation, and using the other 
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VM 



4>end reheating 


^ 0 


A0 


Figure 1.1: A prototypical potential for slow-roll inflation, taken from Ref. [37]. 


slow-roll relations as before, we obtain 



(1.60) 


We have already shown that ^/H(j) < 1, so r/v < 1 is implied by slow roll. 

Thus we can equivalently impose slow roll by demanding that the potential slow-roll 
parameters, and be small.® This tells us how to construct a good slow-roll potential: 
it should be very, very flat. Specifically, its first two derivatives with respect to (j) should 
be small in such a way that 1^ < V/Mpi and 1^^ < V/Ml^. A prototypical inflationary 
potential is shown in figure 1.1. Popular forms for the potential include V ^ cj)^, 1/ ~ 
and V^ cos(p/f. 

We conclude by mentioning that this formalism can be applied to the late-time accelerat¬ 
ing phase as well. This idea underlies quintessence models of dark energy. In quintessence, 
the cosmological constant is given dynamics by being promoted to a scalar field. This allows 
for w to vary from -1, and can also change the way that structure forms, because unlike 
a cosmological constant, quintessence can cluster. The mathematical formalism is almost 
exactly the same as that presented in this section. In the simplest models, the scalar field 
has the same Lagrangian, hence the same density and pressure, as we have used, and the 

^During slow roll, the potential slow-roll parameters are related to e and rjhy and r]yi^2e — 
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condition for acceleration is again that the potential dominate over the kinetic energy, now 
at late rather than early times. The main differences are that we need to include matter 
(specifically baryons and dark matter) in the matter Lagrangian as well, and that the potential 
is allowed to satisfy the slow-roll conditions forever, since while we know inflation ended, the 
same constraint does not apply to dark energy. 




Is this quintessence o[r] dust? 


Hamlet, Hamlet, 2.2 

2 

Gravity Beyond General Relativity 


At the core of this thesis is the question of modifying general relativity. In the previous chapter, 
we introduced general relativity and its cosmological solutions, culminating in a discussion 
of two key aspects of the cosmological standard model: ACDM at late times and inflation at 
early times. In this chapter, we extend that discussion to theories of gravity beyond general 
relativity, and in particular the theories which will receive our attention in this thesis: massive 
gravity, massive bigravity, and Einstein-aether theory. 

General relativity is the unique Lorentz-invariant theory of a massless spin-2 field [12-16]. 
To move beyond this theory, we must therefore modify its degrees of freedom. In massive 
gravity, this is done by endowing the graviton with a small but nonzero mass. Bigravity extends 
this by giving dynamics to a second tensor field which necessarily appears in the action for 
massive gravity; its dynamical degrees of freedom are two spin-2 fields, one massive and one 
massless. Finally, in Einstein-aether theory the massless graviton is supplemented by a vector 
field. This vector is constrained to always have a timelike vacuum expectation value (vev), 
and so spontaneously breaks Lorentz invariance by picking out a preferred time direction. It 
is thus a useful model for low-energy gravitational Lorentz violation. 
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2.1 Massive Gravity and Bigravity 

The history of massive gravity is an old one, dating hack to 1939 when the linear theory of 
Fierz and Pauli was published [38]. Studies of interacting spin-2 field theories also have a 
long history [39]. However, there had long been an obstacle to the construction of a fully 
nonlinccir theory of massive gravity in the form of the notorious Boulweire-Deser ghost [40], a 
pathological mode that propagates in massive theories at nonlinear order. This ghost mode 
was thought to be fatal to massive and interacting bimetric gravity until only a few years ago, 
when a way to avoid the ghost was discovered by utilising a very specific set of symmetric 
potential terms [41-48] . In this section we review that history, before moving onto the modern 
formulations of ghost-free massive gravity and bigravity, and elucidating their cosmological 
solutions. We refer the reader to Refs. [49, 50] for in-depth reviews on massive gravity and its 
history. 


2.1.1 Building the Massive Graviton 


The Fierz-Pauli Mass Term 


Let us begin by considering linearised gravity described by a spin-2 field, • We will routinely 
refer to this field as the graviton. The theory of a massless graviton is given by linearising the 
Einstein-Hilbert action around Minkowski space, i.e., by splitting the metric up as 


Sfiv — ^jiv T 


Mpi 


-h 


flVf 


( 2 . 1 ) 


where < 1, and keeping in the action only terms quadratic in Doing this, we 

obtain the Lagrangian of linearised general relativity. 


1 

-^GR,linear — 7 

4 




( 2 . 2 ) 
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where indices are raised and lowered with the Minkowski metric, rj^y, and we have defined 
the Lichnerowicz operator, S, by 



(2.3) 


where h = rj^'^hy.y is the trace of h^y. No other kinetic terms are consistent with locality, 
Lorentz invariance, and gauge invariance under linearised diffeomorphisms. 



(2.4) 


Indeed, this uniqueness is a necessary (though not sufficient) part of the aforementioned 
uniqueness of general relativity as the nonlinear theory of a massless spin-2 field. 

The role of gauge invariance is to ensure that there are no ghosts, i.e., no degrees of 
freedom with higher derivatives or wrong-sign kinetic terms. Ostrogradsky’s theorem tells us 
that, up to a technical condition, ^ a Lagrangian with higher than second derivatives will lead 
to a Hamiltonian which is unbounded from helow, and thus states with arbitrarily negative 
energy are allowed (for a thorough, modern derivation, see Ref. [51]). If we had included 
in equation (2.2) other terms that can be constructed out of h^y and its first and second 
derivatives, then the action would no longer be invariant under equation (2.4). In that case, 
we could split h^y into a transverse piece, and a vector field, as 


h/xy — h^^ -\-2d{pi/y'j. 


(2.5) 


and any terms not included in the action (2.2) would contain pieces with higher derivatives of 
Xu- Therefore we can see the linearised Einstein-Hilbert term as the kinetic term uniquely set 
by three requirements: locality, Lorentz invariance, and the absence of a ghost. 

We would like to give hf^y a mass—i.e., add a nonderivative interaction term—while 
maintaining those three requirements. Unfortunately, it is impossible to construct a local 


^Namely that the Lagrangian be nondegenerate, i.e., that dL/dq depend on q. 
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interaction term which is consistent with diffeomorphism invariance (2.4). Since this was use¬ 
ful in exorcising ghost modes, we will need to take care to ensure that no ghost is introduced 
hy the mass term. At second order, this is not especially difficult as there are only two possible 
terms we can consider: and h^. We can then consider a general quadratic mass term, 

- (1 - a)h^] . ( 2 . 6 ) 

This leads to a ghostlike, scalar degree of freedom with mass The only way 

to remove the ghost from this theory, besides setting m = 0, is to set a = 0. The ghost then 
has infinite mass and is rendered nondynamical. We see the unique ghost-free action for a 
massive graviton at quadratic order is the Fierz-Pauli action, 

= - h^) . (2.7) 


The Stiickelberg “Trick” 

Before moving on to higher orders in h^y, let us take a moment to count and classify the 
degrees of freedom it contains at linear order. Recall that a massless graviton contains two 
polarisations. Because we lose diffeomorphism invariance when we give the graviton a mass, 
it will contain more degrees of freedom. In fact, there are five in total. In principle, a sixth 
mode can arise, but it is always ghost-like and must therefore be removed from any healthy 
theory of massive gravity. To separate the degrees of freedom contained in the massive 
graviton, we use the Stiickelberg “trick.” 

Stiickelberg’s idea is based on the observation that a gauge freedom such as diffeomor¬ 
phism is not a physical property of a theory so much as a redundancy in description, and that 
redundancy can always be introduced by bringing in redundant variables. Let us consider 
splitting up hj^y as 

2 2 

hfiv ^ hfiy H 3{ii-^y) H '^djxdyfp. (2.8) 

Defining the field strength tensor for analogously to electromagnetism, F^v = | - as 
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well as n^v = dij,dv(j) and the trace notation [A] = the Fierz-Pauli action (2.7) becomes 

(n^v - [njry^v) - 

- ^ hrj^^) (2.9) 

This action is invariant under the simultaneous gauge transformations 

m 

V'(2.10) 


for and 




( 2 . 11 ) 


for A^. With these gauge invariances restored, one can find that h^v contains the usual two 
independent components of a spin-2 degree of freedom, A^ similarly contains the standard 
two independent components, and (j) contains one, leading to a total of2 + 2 + l = 5 degrees 
of freedom for a Fierz-Pauli massive graviton. 


Before moving on, let us briefly consider the limit m ^ 0. Intuitively we would expect this 
to reduce to general relativity. In this limit, the vector completely decouples from the other two 
fields, while the scalar remains mixed with the tensor. They can be unmtxed by transforming 
hfjiv —> + <pr}^y. However, this transformation introduces a coupling between (j) and the 
stress-energy tensor for matter (which, for simplicity, we have neglected so far) which does 
not vanish in the massless limit. This is the origin of the van Dam-Veltman-Zakharov (vDVZ) 
discontinuity [52, 53]. While linearised Fierz-Pauli theory with matter indeed does not reduce 
to general relativity in the limit m ^ 0, nonlinear effects cure this discontinuity: this is the 
celebrated Vainshtein mechanism which restores general relativity in environments where (f) 
is nonlinear and allows theories like massive gravity to agree with solar system tests of gravity 
[54]. For a modern introduction to the Vainshtein mechanism, see Ref. [55]. 
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The Boulware-Deser Ghost 


Upon moving beyond linear order, disaster strikes. While the Fierz-Pauli tuning (expressed 
above asa — O) removes a sixth, ghostlike degree of freedom from the lineeir theory, Boulware 
and Deser found that this mode generically reappears at higher orders [40], This is the 
notorious Boulware-Deser ghost. It is infinitely heavy on flat backgrounds, as evidenced by 
its infinite mass at the purely linear level, but can become light around nontrivial solutions 
[41], including cosmological backgrounds [56] and weak-field solutions around static matter 
[57-59]. 

A full history of this ghost mode is beyond the scope of this thesis; we will simply, following 
the review [49], introduce the simplest nonlinear extension of the Fierz-Pauli mass term 
and demonstrate the existence of a ghost mode, as an illustration of the fact that nontrivial 
interaction terms are required beyond the linear order in order to obtain a ghost-free theory 
of massive gravity. Let us define the matrix 

= ( 2 . 12 ) 

Linearising this matrix around as above, we find 

— (2.13) 

Mpi 

The Fierz-Pauli term (2.7) can be obtained by linearising [g^y — rj^y -t- h^y/Mpi) the nonlinear 
action [57] 

.^FP,nonlinear = -m2M2j/^{[(I-M)2] - [I-M]'}, (2.14) 

where I is the identity matrix in four dimensions. 

With a candidate nonlinear completion of massive gravity in hand, let us examine the 
behaviour of the helicity-0 mode, (j), ignoring the helicity-1 and helicity-2 modes for simplicity. 
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The matrix M becomes 




Mp\m 


+ 




rni^nn\ 


Plugging this into our nonlinear action, we find 


(2.15) 


-^FP,nonlinear = ([H^] - [Uf] + —^ {[U^] - [H] [H^]) + ([H^ - . ( 2 . 16 ) 

This clearly contains higher time derivatives for (p, and will therefore lead to an Ostrogradsky 
instability and hence to ghosts. In fact, even the first, quadratic term—the most obvious an¬ 
cestor of the Fierz-Pauli term—has higher derivatives. However, they turn out to be harmless 
for the reason that this quadratic term is, after integration by parts, a total derivative and 
therefore does not contribute to the dynamics. This miracle does not extend to any of the 
higher terms, which lead to a genuine sixth degree of freedom.^ 

The Boulware-Deser ghost is not specific to this particular, simple nonlinear completion 
of the Fierz-Pauli term. It is a very generic problem, to the point that as of a decade ago it 
was thought to plague all nonlinear massive gravity theories [58] . The ghost can, however, 
be removed: one can consider general higher-order extensions and, at each order, tune the 
coefficients to eliminate higher time derivatives by packaging them into total derivative terms 
[41]. This led to a ghost-free, fully nonlinear theory of massive gravity in Ref. [42], nearly four 
decades after the discovery of the Boulware-Deser ghost. 


To the Nonlinear Theory 


Taken on its own, linearised gravity in the form (2.2) is a perfectly acceptable theory without 
being seen as a truncation of a nonlineeir theory such as general relativity. Indeed, it is even a 
perfectly fine gauge theory, as its linearised diffeomorphism invariance is exact (as long as 
we include the Stiickelberg fields, if we take the graviton to be massive). The wrench in the 

^While we have seemingly split the metric up into five degrees of freedom—two tensor, two vector, and one 
scalar—and focused on the scalar, when (j) has higher time derivatives it in fact contains two degrees of freedom, 
one of which is generically a ghost. 
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works comes when we add matter in. Unfortunately, the coupling to matter is necessary, as 
we prefer our theories to communicate with the rest of the Universe and hence he subject to 
experiment. 

The coupling to matter at the linear level is of the form 




matter,linear — 


2M 


PI 


-h 


(2.17) 


where T^'' is the stress-energy tensor of our matter source. Diffeomorphism invariance is 
preserved in the matter sector if stress-energy is conserved, i.e., if — 0. However, the 

coupling to necessarily induces a violation of this conservation. For a simple example of 
this using a scalar field, see Ref. [49]. This problem is fixed by adding nonlinear corrections 
both to the matter coupling. 




matter,nonlinear ‘ 


^ h I ^ h h 


2M 


PI 


2M 


(2.18) 


PI 


for some tensor ^ and to the gauge symmetry, symbolically written as 


h^h + d^ + - d[h^). 

Mpi 


(2.19) 


While this ensures the conservation of the stress-energy tensor at the linear level, it is broken 
at the next order, and so we must continue this procedure order by order, ad infinitum. 

For a massless spin-2 field, the end result of this procedure is well-known: it is general 
relativity. The fully nonlinear gauge symmetry is diffeomorphism invariance, and as long as 
the matter action is invariant under this symmetry, the stress-energy tensor is covariantly 
conserved, = 0. The linear action (2.2) must be promoted to something which is also 

consistent with this symmetry, and there is one answer: the Einstein-Hilbert action (1.4). 

If we wish to extend this procedure to a massive graviton, i.e., nonlinearly complete the 
Fierz-Pauli mass term, then, as discussed in section 2.1.1, the Boulware-Deser ghost looms as 
a pitfall. Demanding that this ghost be absent will severely restrict the allowed potentials to 
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a very specific and special set of functions of M. However, the gauge invariance can still be 
restored quite easily by a nonlinear version of the Stiickelberg trick. Because this elucidates 
several of the properties of massive gravity, we will review it here. 

When constructing a nonlinear candidate theory of massive gravity in section 2.1.1, we 
employed the matrix M = g~^ri, where g“Ms the inverse of the dynamical metric and rj is the 
Minkowski metric. The appearance of a second, fixed metric in addition to the dynamical one 
is new to massive gravity. Indeed, it is necessary to have such a second metric, or reference 
metric, in order to give the graviton a mass. A mass term is a nonderivative interaction,^ and 
the only nonderivative scalars or scalar densities we can construct out of g^v alone are tr g = 4 
and det g. The first possibility is trivial, while it was shown in Ref. [40] that functions of the 
metric determinant can only consistently lead to a cosmological constant as well. Therefore 
we need a reference metric in order to construct a massive graviton.^ 

Note that, while we have so far taken the reference metric to be Minkowski, in principle we 
could extend the theory to a more general reference metric, Consequently, even once we 
have specified the interaction potential there are many different massive gravity theories, one 
for each reference metric. Alternatively, f^v can be viewed as a “constant tensor” which must 
be specified by hand. Physically, the reference metric corresponds to the background around 
which linear fluctuations acquire the Fierz-Pauli form [47, 65]. This is why we have naturally 
discussed theories with a Minkowski reference metric: we began by considering fluctuations 
around that metric, and so it remains when extending to the nonlinear theory. Note that 
ftiv = J7/XV is a natural choice, as the theory then possesses a Poincare-invariant preferred 
metric, allowing us to define mass and spin regardless of the solutions of the theory.® 

The nonlinear Stiickelberg trick is simply to introduce into the reference metric four 
Stiickelberg fields, as 

ftiv f^v = fabdi,(p“dy(p^. ( 2 . 20 ) 

^We could not have kinetic interactions anyway; in four dimensions, the Einstein-Hilhert term is unique [35]. 

^We have assumed locality in this discussion. A nonlocal theory can give the graviton a mass without the 
need for a reference metric [60-62]; see Refs. [63, 64] for studies of two interesting realisations of this idea. 

®We thank Claudia de Rham for emphasising this point. 
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Note that here Latin indices are in field space, not spacetime; in particular, each of the 
fields transforms as a spacetime scalar. Consequently, ff^v transforms as a tensor under 
general coordinate transformations, as well as and all of the nonlinear completions of 
the Fierz-Pauli term constructed out of it, as long as we replace f^v with /^v If we choose 
the coordinate axes to align with the Stiickelherg fields, — (p^, then we have = fjiv and 
recover the previous description. This is called unitary gauge. For simplicity we will usually 
work in unitary gauge when dealing with massive gravity. 

2.1.2 Ghost-Free Massive Gravity 

de Rham-Gabadadze-Tolley Massive Gravity 

Recent years have seen a breakthrough in massive gravity, stemming from the development of 
a fully ghost-free and nonlinear theory by de Rham, Gabadadze, and Tolley (dRGT) [41,42], fol¬ 
lowing the important preliminary steps taken in Refs. [66, 67] using auxiliary extra dimensions. 
The full proof that the dRGT theory is ghost-free, including for a general reference metric, was 
completed in Refs. [46-48] . There are indications that this theory is the unique ghost-free 
massive gravity; in particular, new kinetic interactions do not appear to be consistent [68, 69]. 
We will use the formulation of the dRGT interaction potential developed in Ref. [45] . 

The action for dRGT massive gravity around a general reference metric, is 



( 2 . 21 ) 


or, equivalently. 



®As discussed above, we are, stricdy speaking, writing the action for massive gravity in unitary gauge. Extend¬ 
ing to a more general gauge by including Stiickleberg fields by promoting /^v to fabdi_i4’‘‘dy4>^ is trivial. 
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where we have defined the square-root matrix X as 


X=Vg^, (2.23) 

i.e., defined X so that (reintroducing explicit spacetime indices) X/^aX"v = g^"fav, and the 
related matrix K hy 

K = I-X. (2.24) 

Here, a„ and cire dimensionless coupling constants, generally taken to he free peirameters, 
and are the elementary symmetric polynomials of the eigenvalues A; of the matrix argument. 
In terms of the eigenvalues, assuming i runs from 1 to 4, these are (taking as the argument a 
general matrix. A, for concreteness) 

eo(A) = 1, 

ei(A) = Ai -f A2 “f As -f A4, 

e2(A) = Ai A2 -f Ai A3 -|- A3 A4 -|- A2A3 -|- A2A4 -|- A3A4, 
e3(A) = Ai A2A3 -|- A3 A2A4 -|- A3 A3 A4 -|- A2 A3A4, 

e 4 (A) = A3A2A3A4 = det A. (2.25) 


It is often more useful to write these polynomials directly in terms of the matrix, 

eo(A) = 1, 
e3(A) = [A], 

e2(A) = ^([A]2-[A2]), 

e3(A) = ^ ([A] ^ - 3 [A] [A^] -I- 2 [ A^] ) , 

e 4 (A) = det(A). (2.26) 


The formulations (2.21) and (2.22) in terms of K = I — g and X = g are both very 
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common in the literature. For reasons partly physical and partly historical, the K formulation 
is more common in massive gravity, while X is predominant in bigravity. The free parameters 
in the two formulations, a„ and are related by [45] 



(2.27) 


Throughout this thesis we will use the X basis and parametrisation except when stated 
otherwise. 

Notice that although these potential terms are very complicated, they have a significant 
amount of structure. This can be better understood by decomposing the metrics into their 
vielbeins, defined by 



(2.28) 


Since vielbeins are, in a sense, the “square roots” of the metrics, and the dRGT potential terms 
are built out of a square root matrix, this is a natural language in which to formulate massive 
gravity. The interaction terms are, up to numerical constants, given by [70] 


eo(X) (X eabcde^^'‘^^E%E\,E‘=aE‘^p, 

ei(X) oc €abcde>^^'‘^^E%E\,E^aL‘‘p, 
e2(X) oc eabcde>^^'‘^^E%E\,L‘^aL‘^p, 
esiX] oc €abcde>^^'‘^^E%L\.L‘=aL‘^p, 
eo{X] oc eabcde>^''’^^L%L\UaL%. 


(2.29) 


We can now understand the simplicity of the dRGT interaction potential: it is a linear com¬ 
bination of the only possible wedge products one can construct from E“ = and 


Finally, notice that we have only coupled matter minimally to the dynamical metric, g^v 
More general matter couplings are certainly possible, but the vast majority of attempts to 
couple the same matter sector to both metrics reintroduce the Boulware-Deser ghost [71- 
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73] . The question of formulating and studying more general matter couplings will form a 
significant part of this thesis, lying at the heart of chapters 5 to 7. 

Hassan-Rosen Bigravity 

In dRGT massive gravity, the reference metric, is fixed and must be put into the theory 
by hand. This leads to a multiplicity of theories of massive gravity: massive gravity around 
Minkowski, around de Sitter, around anti-de Sitter, and so on. As shown by Hassan and Rosen 
in Ref. [74], the reference metric can be freely given dynamics without spoiling the ghost-free 
nature of the theory, as long as its kinetic term is also of the Einstein-Hilbert form. This leads 
to Hassan-Rosen bigravity or massive bigravity, 



(2.30) 


where R[g) and R[f) are the Ricci scalars corresponding to each of the metrics. We have 
allowed for the two metrics to have different Planck masses. Mg and M/, although as long as 
both are finite, they can be set equal to each other by performing the constant rescalings [75] 




(2.31) 


where M* = Mf/Mg. Therefore the /-metric Planck mass is a redundant parameter. We will 
generally perform this rescaling implicitly in later chapters, although for now we will leave 
both Planck masses in to help to elucidate some of the physical features of the theory. 

In terms of free parameters bigravity is simpler than massive gravity: we have traded a 
constant matrix (/^v) for a constant number (M/) which is not even physically relevant. We 
thus need to specify fewer theory ingredients to test its solutions. ' On the other hand, it is 
less simple from the more theoretical point of view that it contains more degrees of freedom. 

^Note however that the /-metric cosmological constant, is physically relevant in higravity hut not in 
massive gravity, as it is independent of g^,. and hence only contributes to the equation of motion for fg,,- 
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The mass spectrum of bigravity contains two spin-2 fields, one massive and one massless. We 
can see this at the linear level by expanding each metric around the same background, g^y, as 


gjiv — g/iv + ^ hj^v> 


n 


flV 




(2.32) 


For simplicity we assume the “minimal model” introduced in Ref. [45], given by 


l3o = 3, A = ^2 = 0, ^3 = 0, A = l- 


(2.33) 


The quadratic Lagrangian is given by [74] 




HR,linear ■ 


4 piv «P ^ 


8 


W, 


Mfj ^ 




(2.34) 


where we have defined Indices are raised and lowered with the background 

metric. We notice the usual Einstein-Hilbert terms for each of the two metric perturbations, 
as well as two Fierz-Pauli terms with some additional mixing between h^y and Z^v This can 
be easily diagonalised by performing the change of variables 


1 _ 1 ^ 1 7 


ZVTeff 


' ^/iv — , , fi//V , , Z 


Mf 


•'flV 




flV • 


The resultant unmrxed Lagrangian, 


(2.35) 




HR,linear 




/iV 


flV 




— m 
8 




(2.36) 


contains a Fierz-Pauli term for V/j^y and no interaction term for U/j^y. Therefore in the linearised 
theory u^y corresponds to a massless graviton and Vf^y to a ghost-free massive one with mass 
m. We can see that in the limit where one Planck mass is much larger than the other, the 
metric with the larger Planck mass corresponds mostly to the massless graviton: if M/ » Mg, 
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then [Ujjiv, V/jy) [Ifivjhny), and similarly if Mg » Mf, then {Ujj,y, v^y) —> [hi^v, — l^y). This 
formalises the notion, which is intuitive from equation (2.30), that we can recover dRGT 
massive gravity hy taking one of the Planck masses to infinity. In that case, the massless 
mode corresponds entirely to the metric with the infinite Planck mass, its dynamics freeze 
out so that it becomes fixed, and the massless mode decouples from the theory, leaving us 
with what we expect for massive gravity.® Note, finally, that the notion of mass is only really 
well-defined around Minkowski space, as it follows from Poincare invariance. More generally 
we can identify modes with a Fierz-Pauli term as massive, hy analogy to the Minkowski case. 
As shown above, one can identify massive and massless linear fluctuations in this way around 
equal backgrounds for a special parameter choice, and indeed this can be done for general 
parameters as long as g^y and are conformally related [71] , but for general backgrounds 
there is no unambiguous splitting of the massive and massless modes in bigravity. 

An interesting and useful property of Hassan-Rosen bigravity is that while g^y and f^y do 
not appear symmetrically in the action (2.30), it nevertheless does treat does metrics on equal 
footing, ignoring the matter coupling. In particular, the mass term has the property 

4 _ _ 4 

(2.37) 

n=0 n=0 

which follows from the identity ■\J-gen g v] = ^4-n (/F gj [74]. This can easily 

be seen by formulating the polynomials in terms of the eigenvalues of X as in equa¬ 
tion (2.26). The result follows from using basic properties of the determinant and the fact that, 
because ^/g~^f and 4 / f~^g are inverses of each other, their eigenvalues are inverses as well. 
As a consequence, the entire Flassan-Rosen action in vacuum is invariant under the exchange 
of the two metrics up to parameter redefinitions, 

g/iv * *flJ,Vy Pn ^ P4-n- (2.38) 

The fact that the matter coupling breaks this duality by coupling matter to only one metric 

®See, however, Ref. [76] for some caveats on taking the massive-gravity limit of higravity. 
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will motivate the search for “double couplings” in later chapters. 

By varying the action (2.30) with respect to the g and / metrics we obtain the generalised 
Einstein equations for massive bigravity [45] , 

3 j 

+ = —(2.39) 

n=0 g 

2 3 

(//^) =0- (2.40) 

* n=0 

where is the Einstein tensor computed for a given metric. The interaction matrices i(„)(X) 
are defined as 


i[o)(X) = I, 


i[i)(X) = X-I[X], 

i[2)(X) = - X[X] + \ ([X]2 - [X^]) , 

i[3)(X) = X^ - X2[X] + ^X ([X]2 - [X^]) 

-^i([X]^-3[X][X2]+2[X^]). (2.41) 

Notice that they satisfy the relation [71] 

n 

i^„)(X) = ^(-l)‘X”-'eKX). (2.42) 


The tensors are symmetric and so do not need to be explicitly symmetrised [71], 

although this fact has gone unnoticed in much of the literature. Einally, is the stress- 
energy tensor defined with respect to the matter metric, g. 


2 5(y-detgi^',^) 


(2.43) 


It is not difficult to check that when = 0, the Einstein equations are symmetric under the 

interchanges (2.38). 
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General covariance of the matter sector implies conservation of the stress-energy tensor 
as in general relativity, 

V^r^v=0. (2.44) 

Furthermore, by combining the Bianchi identities for the g and / metrics with the field 
equations (2.39) and (2.40), we obtain the following two Bianchi constraints on the mass 
terms: 


(vTv) =0, 

n=0 

(2.45) 


(2.46) 


after using equation (2.44). Only one of equations (2.45) and (2.46) is independent: a linear 
combination of the two divergences can be formed which vanishes as an identity, i.e., re¬ 
gardless of whether g^v and satisfy the correct equations of motion [77] , so either of the 
Bianchi constraints implies the other. 


Field Equations for Massive Gravity 

Note that we can also easily obtain the equations of motion for dRGT massive gravity from 
the bimetric equations: the Einstein equation is simply equation (2.39) with fixed to the 
desired reference metric, 

3 ^ 

+ (vrV) = TjrJiv (2.47) 

n=0 

and matter is conserved with respect to g^,, as usual. This quick “derivation” should be 
taken purely as a heuristic—i.e., if we had started off with the dRGT action (2.21) and varied 
with respect to g^y, we would clearly obtain equation (2.47) regardless of whether f^y is 
dynamical—and not as the outcome of a limiting procedure. Indeed, because f^y lacks 
dynamics there is no analogue of its Einstein equation (2.40), and including that equation 
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would lead to extra constraints. It turns out that massive gravity can be obtained as a limit of 
bigravity, but the process is more subtle than simply taking Mf^oo (which freezes out the 
massless mode and equates it with f^v, as discussed above) [76, 78]. Alternatively, the dRGT 
action can be obtained from the bigravity action by taking M/ ^ 0, but to obtain massive 
gravity we must throw away the f^v Einstein equation (2.40) by hand. If we leave it in then it is 
determined algebraically in terms of g^y Plugging this into the mass term we simply obtain a 
cosmological constant; thus this is the general-relativity limit of massive gravity. This agrees 
with the linear analysis above, where we found that in the limit M/ —> 0, the fluctuations of 
g^v become massless. 

2.1.3 Cosmological Solutions in Massive Bigravity 

In this subsection we review the homogeneous and isotropic cosmology of massive bigravity. 
We will follow the framework derived in Refs. [79, 80], and use, with some generalisations, the 
notation and approach summarised in Ref. [2]. As discussed above, we will rescale and 
so that the two Planck masses are equal, Mf — Mg. 

Cosmological Equations of Motion 

We assume that, at the background level, the Universe can be described by Friedmann- 
Lemaitre-Robertson-Walker (FLRW) metrics for both g^y and /^v. Specialising to spatially-flat 
metrics, we have 


ds^—-N{tfdt^ + a{tfdx^, (2.48) 

o 

ds^^ — —X{tfdt^ + Y{tYd^, (2.49) 

where a[t) and Y[t) are the spatial scale factors for g^y and fg^y, respectively, and N{t) and 
X[t) are their lapses. In the rest of this thesis we will leave the time dependences of these 

®This is because we are effectively taking the dRGT action and varying with respect to f^y, treating it like a 
Lagrange multiplier. Hence f^y cannot be picked freely in this case but is rather constrained in terms of g^,, ■ 
i°We thank Fawad Hassan for helpful discussions on these points. 
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functions implicit. We will find it useful to define the ratios of the lapses and scale factors, 

X Y 

x = —, y = —• (2.50) 

N a 

Notice that these quantities are coordinate-independent: while we can freely choose either 
lapse or rescale either scale factor, their ratio is fixed. This is because higravity is still invariant 
under general coordinate transformations as long as the same transformation is acted on 
each metric. 

With these choices of metrics, the generalised Einstein equations (2.39) and (2.40), assum¬ 
ing a perfect-fluid source with density p — - T°o, give rise to two Friedmann equations, 

3H^ = —p + (^0 + 3^iy +3^2y' + ^3y=') , (2.51) 

3K^ = (ySiy + SySzy + SySay + Pa) , (2.52) 

where we have defined the Eluhhle rates as^^ 

H = d/a, K=YIY, (2.53) 

and overdots denote time derivatives. We will specialise in this thesis to pressureless dust, 
which obeys 

p+3Efp=0. (2.54) 

The Bianchi constraint—either equation (2.45) or equation (2.46) —yields 

m^a^p[Xd-NY)^0, (2.55) 


ifin group-theoretic terms, there are two diffeomorphism groups, one for each metric, and bigravity breaks 
the symmetry under each of them but maintains the symmetry under their diagonal subgroup. This is obvious 
from the fact that the mass term only depends on the metrics in the combination g^^/av 

i^ln order to present the cosmological solutions for general lapses, we will define the g-metric Hubble rate 
differently here than in the rest of this thesis; in particular, H is not necessarily the cosmic-time Hubble rate. 
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where we have defined 


p = ^i+2^2y+^3y^ 


(2.56) 

The Bianchi constraint has two branches of solutions: 



Algebraic branch: 

P = 0, 


Dynamical branch: 

Y 

X — 

d 



The algebraic branch is satisfied if ySi + 2yS2y + ySay ^ = 0, which seems to be nongeneric as it 
requires tuned initial conditions. Because the solutions on this branch have y = const., the 
mass term in equation (2.51) clearly reduces to a cosmological constant. Thus the algebraic 
branch, at the background level, is equivalent to ACDM [79, 81]. At the level of linear perturba¬ 
tions, evidence has been found for several modes being strongly coupled [82]. Consequently 
we will focus our attention on the dynamical branch. In this case, the Bianchi constraint 
implies that the ratio of the lapses, x, can be written in terms of other background quantities 
as 


X = 


H ■ 


(2.57) 


The Friedmann equations, (2.51) and (2.52), and the Bianchi identity (2.57) can be com¬ 
bined to find a purely algebraic, quartic evolution equation for y, 

+ (3^2“A)+3 (^1 — p-i)~^P^^ 3^2^ y “^1 = 0. (2.58) 

The g-metric Friedmann equation (2.51) and quartic equation (2.58) completely determine 
the expansion history of the Universe. As in standard cosmology, we see that the cosmic 
expansion is governed by a Friedmann equation. It is sourced by a mass term that depends 
on y, the evolution of which is in turn determined by the quartic equation. 


It will be useful to simplify the dynamics by expressing all background quantities solely in 
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terms of y (t) and then solving for y (a). We can rearrange equation (2.58) to solve for p(y), 


g 


~^3y +(A“3^2)y +3(^3— ^i)y+ 3^2 “^o + ^iy • (2.59) 


We can then substitute this into the Friedmann equation to find H{y], 


(Ay^ + 3yS3y+3yS2 + y3iy“^). 


(2.60) 


By taking a derivative of the quartic equation (2.58) and using the fluid conservation equation 
(2.54) and our solution (2.59) for p(y) we can find an evolution equation for y (a), 

dlny ^ y ^ 3 ^3y^ + (3^2-A)y^+3(A-^3)y^ + (^o-3^2)y - A .2 61) 

din a Hy 3^3y^ + 2(3^2-A)y^+3(A-^3)y^ + A 

Using the definition of y to find y, we can easily write K{y], 

K = H+-. (2.62) 

y 

We can now write any background quantity in terms of y alone, except for y and a themselves, 
and further we have two avenues for determining y (a): integrating equation (2.61), or using 
the quartic equation (2.58) with p = poa~^. These expressions will be crucial throughout this 
thesis since they reduce the problem of finding any parameter—background or perturbation— 
to solving for y (z), where z = l/a-lis the redshift.^^ 

We note briefly that there has been some discussion in the literature over how to correctly 
take square roots in bigravity. There exist cosmological solutions in which det ^g~^f becomes 
zero at a finite point in time (and only at that time), and so it is important to determine 
whether to choose square roots to always be positive, per the usual mathematical definition, 
or to change sign on either side of the point where det ^g~^f — 0. This was discussed in 
some detail in Ref. [83] (see also Ref. [84]), where continuity of the vielbein corresponding 

i^Note that while we have expressed all background quantities in terms of y only, perturbations will in general 
depend on both y and a. 
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to y g~^f demanded that the square root not he positive definite. We will take a similar 
stance here, and make the only choice that renders the action differentiable at all times, i.e., 
such that the derivative of g~^f with respect to g^v and fjiv is continuous everywhere. In 
particular, for the FLRW backgrounds we are dealing with in this section, this choice implies 
that — XV^. This is important because, as we will see in chapter 3, it turns out that 

in the only cosmology with linearly-stable perturbations, the / metric bounces, soX—KY/H 
changes sign during cosmic evolution. With our square-root convention, the square roots 
will change sign as well, rather than develop cusps. Note that sufficiently small perturbations 
around the background will not lead to a different sign of this square root. 


Properties of Bimetric Cosmologies 

We can understand the qualitative behaviour of bimetric cosmologies by taking the early- 
and late-time limits, p ^ oo and p ^ 0, respectively. We will use heuristic arguments to 
motivate results which were determined more rigorously in Ref. [85] and can also be seen 
from a statistical comparison to observations of the expansion history [80]. At early times, 
the quartic equation (2.58) is solved either by y ^ 0 or y —> oo. The former solution is quite 
easy to see: the quartic equation is of the form ... -H py = 0 , where ... contains only positive 
powers of y, so y ^ 0 will clearly be a solution. These are csMed finite-branch solutions. The 
solutions with y ^ oo at early times, or infinite-branch solutions, occur when one of the 
higher powers of y in the quartic equation scales at just the right rate to cancel out the py 
term. These solutions are rather less common; in order to enforce —> 1 and > 0 at early 
times, viable infinite-branch solutions require ^2 = Ps — O and ^^>0 [85]. To see this, notice 
that = N^p/{3M^^H^] can be written in the limit y —> oo, using equations (2.59) and (2.60), 
as 

ySs PI Pi 

^m = l-^y+3^-3^. (2.63) 

Pi Pi Pi 

The condition P 4 > 0 (rather than just P 4 7 ^ 0) arises from demanding, per equation (2.60), 
that be positive at all times. 
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On either branch, at late times y will always flow to a constant, yc, given by the quartic 
equation with p —0, 

P'iYc + (3^2 ~ A) + 3 (^1 — ^ 3 ) + (^0 “ 3 ^ 2 ) yc~ Pi — 0- (2.64) 

Moreover, by taking a derivative of the quartic equation we see that y / 0 unless either p = 0 
or y = 0. Therefore y evolves monotonically throughout cosmic history, flowing either from 0 
up to yc or from 00 down to yc.^'^ 

As long as yc > 0, the mass term asymptotes in the future to a cosmological constant. 
Hence bimetric cosmologies generally possess late-time acceleration with H'^ ~ 

This is the case even if the g-metric cosmological constant, ySo, is turned off, hence these 
theories self-accelerate. Because y cannot be constant in these models,^® the effective dark 
energy is dynamical. In particular, we do not have w — —\ except at the asymptotic future. 
Crucially, the parameters and the potential structure leading to the accelerated expansion 
are thought to be stable under quantum corrections [86], in stark contrast to a cosmological 
constant, which would need to be fine-tuned against the energy of the vacuum [9-11].^® 
Thus we find that bigravity is an excellent candidate for technically-natural self-acceleration. 
Comparisons to background data—specifically the cosmic microwave background, baryon 
acoustic oscillations, and type la supernovae—show that these cosmological models can 
agree well with observations [79, 80, 85]. 

Before ending this subsection, let us consider a worked example: the model with only ySi 
nonzero. Because theoretical viability conditions require this term to be nonzero (ignoring 
the exact ACDM case with — — it is the simplest nontrivial one-parameter model 

which will lead to sensible cosmologies [85]. It has been shown in Refs. [80, 85] that this 

i^Note that these are not necessarily the same yc, as equation (2.64) can have multiple roots. 
i®Unless it is either 0 at all times, which is trivial, or the special case jii— P-i — 0, in which case the Friedmann 
equation can be rewritten, with the help of the quartic equation, as a ACDM Friedmann equation with a rescaled 
gravitational constant [79]. 

matter couples to then matter loops will stUl contribute to m^Po as usual. It is the rest of the dRGT 
potential which is stable under quantum corrections. Consequently we focus on self-accelerating models and 
assume—as is common in the literature—that some unknown mechanism removes the dangerous cosmological 
constant. 
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model provides a self-accelerating evolution which agrees with background cosmological 
observations and, as it possesses the same number of free parameters as the standard ACDM 
model, is a viable alternative to it. Indeed, it may be more viable than ACDM if the graviton 
mass turns out to be stable to quantum corrections, as mentioned above. The graviton mass 
in this case is given by Note that in order to give rise to acceleration at the present era, 

the graviton mass typically should be comparable to the present-day Hubble rate, ySi ~ H^. 

In this simple case, the evolution equation (2.61) for y (a). 


^ r 2 ^ 

d\na V l + 3y^J’ 


(2.65) 


can be integrated exactly to find 



(-C±\/l2a^ + C^y 


( 2 . 66 ) 


Assuming y > 0 forces us to select the positive branch. Using the Friedmann and quartic 
equations, we can set the value of C using initial conditions. 


C = - 




■‘-'o 


I3im 


2 ’ 


(2.67) 


where Hq is the cosmic-time Hubble rate today. Equivalently, we can use the quartic equation 
to solve for y and express the Friedmann equation as a modified expression for H[p), 


g 

In either formulation, the late-time approach to a A-like behaviour is evident. 

The minimal ySi-only model is also distinctive for having a phantom equation of state, 
w{z) Pd -1.22+0 02 “ ;o 4 z/(l -h z) at small redshifts. Moreover, w is related in a simple 

way to the matter density parameter [85] . This provides a concrete and testable prediction 
of the model that can be verified by future large-scale structure experiments, such as Euclid 
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[87, 88], intensity mappings of neutral hydrogen [89, 90], and combinations of structure and 
cosmic microwave background measurements [91]. The model has also been proven in [92] 
to satisfy an important stability bound at all times, avoiding the Higuchi ghost which plagues 
theories of a massive graviton on expanding backgrounds [93]. It is, however, worth noting 
that its linear cosmological perturbations are unstable until z ~ 0.5, as shown in chapter 3 of 
this thesis. We emphasise that this instability does not rule out the ySi-only model, but rather 
impedes our ability to use linear perturbation theory to describe perturbations at all times. 
This raises the interesting question of how to make predictions for structure formation during 
the unstable period, a question which is beyond the scope of this thesis. 

The aforementioned studies have largely been restricted to the background cosmology 
of the theory. As the natural next step, in chapters 3 and 4 we will extend the predictions of 
massive bigravity to the perturbative level, and study how consistent the models are with the 
observed growth of structures in the Universe. 

A No-Go Theorem for Massive Cosmology 

As discussed in the previous section, we can easily obtain the equations of motion for massive 
gravity from those of bigravity; the g-metric equation is the same, and we lose the /-metric 
equation. Note that we also still have the Bianchi constraint (2.55). This fact will turn out to 
be crucial. 

Let us assume that the reference metric is Minkowski, /^^ = Under the assumption 
of homogeneity and isotropy for g^v in unitary gauge, the Friedmann equation is simply 
equation (2.51) with y — a“L This alone would define perfectly acceptable cosmologies. 
However, the Bianchi constraint (2.55) causes trouble. In the bigravity lemguage we now have 
X = T = 1, so the constraint becomes 


m^a^Pd = 0 . 


(2.69) 


Because F = 0, we no longer have an interesting dynamical branch: it merely suggests d = 0. 
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Unfortunately, the algebraic branch, P = 0, also only has a = const, as a solution; in bigravity 
it would fixy = F/a while allowing a and Y to change, but in massive gravity, it is a which 
is fixed. Therefore a is generally fixed to be constant, and this system has no dynamical 
solutions. This is the famous no-go theorem on cosmological solutions in dRGT massive 
gravity, and it is present for both fiat and closed universes [94] . If we instead consider open 
universes or different reference metrics, such as FLRW or de Sitter, then FLRW solutions 
do exist, but they are unstable to the aforementioned Higuchi ghost and other linear and 
nonlinear instabilities [95-100]. 


The search for a viable cosmology with a massive graviton which avoids these conclusions 
has involved two routes. One is to extend dRGT by adding extra degrees of freedom. As 
discussed above, these problems are cured when the second metric is given dynamics. Other 
extensions of massive gravity, such as quasidilaton [101], varying-mass [94, 102], nonlocal 
[60,103,104], and Lorentz-violating [105,106] massive gravity, also seem to possess improved 
cosmological behaviour. The other approach is to give up on homogeneity and isotropy 
entirely. While FLRW solutions are mathematically simple, the Universe could in principle 
have anisotropies which have such low amplitude, are so much larger than our horizon, or 
both, that we cannot easily observe them. Remarkably, these cosmologies are much better 
behaved in massive gravity than is the standard FLRW case [94]. The general scenario of an 
FLRW metric with inhomogeneous Stiickelberg fields has been derived in Refs. [107, 108]. 
This includes, but is not limited to, the case in which the reference metric is still Minkowski 
space, but only has the canonical form = diag(-l, 1,1,1) in coordinates where g^y is 
not of the FLRW form [109]. The inhomogeneous and anisotropic solutions are reviewed 
thoroughly in Ref. [49]. See Ref. [110] for a review of cosmology in massive gravity and some 


of its extensions. 
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2.2 Einstein-Aether Theory 

In the final section of this chapter, we explore a different route to modifying gravity: allowing 
Lorentz symmetry to be violated. This is not a step taken lightly; Lorentz invariance is a 
cornerstone of modern physics. The two theories which have been separately successful at 
predicting nearly all experimental and observational data to date, general relativity to explain 
the structure of spacetime and gravity and the stemdard model of particle physics to describe 
particles and nongravitational forces in the language of quantum field theory, both contain 
Lorentz symmetry as a crucial underlying tenet. 

What do we gain from exploring the breakdown of this fundamental symmetry? Given 
its foundational significance, the consequences of violating Lorentz invariance deserve to 
be fully explored and tested. Indeed, while experimental bounds strongly constrain possible 
Lorentz-violating extensions of the standard model [111], Lorentz violation confined to other 
areas of physics—such as the gravitational, dark, or inflationary sectors—is somewhat less 
constrained, provided that its effects are not communicated to the matter sector in a way that 
would violate the standard-model experimental bounds. Moreover, it is known that general 
relativity and the standard model should break down around the Planck scale and be replaced 
by a new, quantum theory of gravity. If Lorentz symmetry proves not to be fundamental at 
such high energies—for instance, because spacetime itself is discretised at very small scales— 
this may communicate Lorentz-violating effects to gravity at lower energies, which could 
potentially be testable. The study of possible consequences of its violation, and the extent 
to which they can be seen at energies probed by experiment and observation, may therefore 
help us to constrain theories with such behaviour at extremely high energies. 

A pertinent recent example is Hofava-Lifschitz gravity, a potential ultraviolet completion 
of general relativity which achieves its remarkable results by explicitly treating space and time 
differently at higher energies [112]. The consistent nonprojectable extension [113-115] of 
Hofava-Lifschitz gravity is closely related to the model we will explore. Moreover, since we will 
be dealing with Lorentz violation in the gravitational sector, through a vector-tensor theory 
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of gravity, the usual motivations for modifying gravity apply to this kind of Lorentz violation. 
Indeed, there are interesting models of cosmic acceleration, based on the low-energy limit of 
Hofava-Lifschitz gravity, in which the effective cosmological constant is technically natural 
[116,117] . Generalised Lorentz-violating vector-tensor models have also been considered as 
candidates for both dark matter and dark energy [118, 119]. 

Lorentz violation need not have such dramatic, high-energy origins. Indeed, many the¬ 
ories with fundamental Lorentz violation may face fine-tuning problems in order to avoid 
low-energy Lorentz-violating effects that are several orders of magnitude greater than existing 
experimental constraints [120]. However, even a theory which possesses Lorentz invariance 
at high energies could spontaneously break it at low energies, and with safer experimental 
consequences. 

Spontaneous violation of Lorentz invariance will generally result when a field that trans¬ 
forms nontrivially under the Lorentz group acquires a vacuum expectation value (VEV). A 
simple example is that of a vector field whose VEV is nonvanishing everywhere. As mentioned 
above, in order to avoid tbe experimental constraints sucb a vector field should not be cou¬ 
pled to the standard model fields, but in order to not be completely innocuous we will ask 
it to couple to gravity. Moreover, to model Lorentz violation in gravity without abandoning 
the successes of general relativity—in particular, without giving up general covariance—the 
(spontaneously) Lorentz-violating field must be a spacetime vector and must be dynamical. 

A particularly simple, yet quite general, example of a model with these features is Einstein- 
aether theory (ae-theory) [121,122]. It adds to general relativity a dynamical, constant-length 
timelike vector field, called the aether and denoted by which spontaneously breaks 
Eorentz invariance by picking out a preferred frame at each point in spacetime while main¬ 
taining local rotational symmetry, thus breaking only tbe boost sector of Eorentz symmetry 
[122,123]. The constant-length constraint plays two crucial roles. The first is phenomenologi¬ 
cal: it ensures that the aether picks a globally-nonzero VEV and so guarantees that Lorentz 
symmetry is in fact broken. The other role is to ensure that the theory is not sick: if the length 

^^The requirement that the field be dynamical stems from the fact that there is no nontrivial (i.e., nonzero) 
spacetime vector which is covariantly constant, i.e., if = 0 everywhere then necessarily u'' — 0. 
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is not fixed and the kinetic term is not gauge-invariant^® then the length-stretching mode 
has a wrong-sign kinetic term and hence is ghostlike [124]. Note that ae-theory is the most 
general effective field theory in which the rotation group is unbroken [125], and hence it can 
be seen as the low-energy limit of any theory which violates boosts but maintains rotational 
symmetry. 


2.2.1 Pure Aether Theory 

Einstein-aether theory (which we will often refer to as “pure” Einstein-aether theory or ae- 
theory) is a theory of the spacetime metric g^v and a vector field (the “aether”) uf^. The action 
is [122,126] 




(2.70) 


where we have defined 


= Cig^^^'gpo- + -H CiU^u^'gp^. (2.71) 

The action (2.70) contains an Einstein-Hilbert term for the metric, a kinetic term for the 
aether with four dimensionless coefficients c, (coupling the aether to the metric through the 
covariant derivatives), and a nondynamical Lagrange multiplier A. Varying this action with 
respect to A constrains the aether to be timelike with a constant norm, — —m^. The 
aether has units of mass; its length, m, has the same dimensions and corresponds to the 
Lorentz symmetry breaking scale. 

The action (2.70) is the most general diffeomorphism-invariant action containing the 
metric, aether, and up to second derivatives of each. Higher derivatives are excluded because 
they would generically lead to ghostlike degrees of freedom. Most terms one can write down 
involving the aether are eliminated by the fixed norm condition. One could consider a 

i^Note that gauge invariance would uniquely pick out the Maxwell term, in which case we would simply have 
electromagnetism which clearly does not spontaneously break Lorentz symmetry. 
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term , but this is equivalent under integration by parts to - 

(V^ Mv)(V^ k’'), where F^,, = 2 V[^ Uy] is the field strength tensor, and so is already included in 
the ae-theory action [ 121 ]. In what follows we will follow much of the literature on aether 
cosmology (e.g., Refs. [126,127]) and ignore the quartic self-interaction term by setting C 4 = 0. 

It is generally assumed that (standard-model) matter fields couple to the metric only. 
Any coupling to the aether would lead to Lorentz violation in the matter sector by inducing 
different maximum propagation speeds for different fields, an effect which is strongly con¬ 
strained by experiments [111]. These problematic standard-model couplings may, however, 
be forbidden by a supersymmetric extension of ae-theory [128]. The work on ae-theory which 
we detail in chapter 8 will be interested in exploring and constraining Lorentz violation in 
the gravitational sector and in a single non-standard-model scalar, hence we will not need to 
worry about such a coupling. 

The gravitational constant G that appears in equation (2.70) is to be distinguished from the 
gravitational constants which appear in the Newtonian limit and in the Friedmann equations, 
both of which are modified by the presence of the aether [126]. The Newtonian gravitational 
constant, Gn, and cosmological gravitational constant, Gc, are related to the bare constant G 
by 


G 


N — 


Gc^ 


G 

l+8nG5’ 

G 

1 +87rGa’ 


(2.72) 

(2.73) 


where 


5 = —Cim^, (2.74) 

a = (ci3-|-3c2)m^. (2.75) 


We have introduced the notation C 13 = Ci + C 3 , etc., which we will use throughout. 
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2.2.2 Coupling to a Scalar Inflaton 

We now introduce to the theory a canonical scalar field (f) which is allowed to couple kinetically 
to the aether through its expansion, 0 = 'Vj^uf^ [129], The full action reads 




. (2.76) 


Let us pause to motivate the generality of this model. Our aim in chapter 8 will he to constrain 
couplings between a Lorentz-violating field and a scalar, in particular a canonical, slowly- 
rolling scalar inflaton, in as general a way as possible. As mentioned above, Einstein-aether 
theory is the unique Lorentz-violating effective field theory in which rotational invariance is 
maintained [125],^® so any theory which spontaneously violates Lorentz symmetry without 
breaking rotational invariance will be described by the vector-tensor sector of our model at 
low energies. As for the scalar sector, the main restriction is that we have assumed a canonical 
kinetic term. While there are certainly coupling terms between the aether and the scalar which 
do not fall under the form V{d,4)], all of these terms have mass dimension greater than 4 and 
so are irrelevant operators. Such terms are nonrenormalisable. While this is not necessarily 
disastrous from an effective field theory perspective, these terms are also nevertheless mostly 
important at short distances, and so should not factor into the cosmological considerations at 
the heart of this thesis. To see that all terms with dimension 4 or less fall into the framework 
(2.76), notice that the aether, scalar, and derivative operators all have mass dimension 1, 
the aether norm is constant so cannot be used in the coupling, and the aether and 
derivative operators carry spacetime indices which need to be contracted. Subject to these 
constraints, one can see that any terms which involve both and (j) and have dimension 4 
or less are either of the form f[0,(p) or can be recast into such a form under integration by 
parts. In particular, the only nontrivial interaction operators which are not irrelevant are (p 0 
(dimensions) and^^0 (dimension4). 

This type of coupling was originally introduced with a more phenomenological motivation 

i®However, we note that there is an allowed term, the quartic self-interaction parametrised by C 4 , which we 
have turned off. 
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[129]. In a homogeneous and isotropic background, the aether aligns with the cosmic rest 
frame, so 0 is essentially just the volume Hubble parameter, 0 —3m H. Hence the introduction 
of the aether allows a scalar inflaton to couple directly to the expansion rate. This is impossible 
in GR where H is not proportional to any Lorentz scalar. 

The aether equation of motion, obtained by varying the action with respect to is 

(2.77) 

where the current tensor is defined by 


JF^-K^^\pVvuP, 


(2.78) 


and we are denoting partial derivatives of the potential by V© = dV/dO and = dV/d(j). 
Projecting this equation along allows us to obtain the Lagrange multiplier A, 




m 


2m^ 


(2.79) 


We will account for the modification to gravity by leaving the Einstein equations in the 
standard form (1.6) and defining a stress-energy tensor for the combined aether-scalar system. 
Taking into account the contribution from the Lagrange multiplier term, this is given by 


*/iV 


5^ „ 5^ 

■ 2- -h uP-—-UfiUy -S£ 


5g 


(jLV 


5uP 


fIV 


(2.80) 


where if is the Lagrangian for the aether and scalar. Using this formula we find the stress- 
energy tensor. 


r^v = 2ci(V^u^Vv Up - VP UpVp Uy) 

-2[VpiU^pjPy)) + VpiuPFy))-VpiUlpJyf)] 
-2m^^UiyV pFP UpUy + gpyS£u 
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+ + v-eve^ g^v 

+ {uf’VpVe){gi,v + m~^Ui,Ur], (2.81) 

where S^u = po-’VpUP'VyU^^ is the Einstein-aether Lagrangian. Finally, the inflaton obeys 
the usual Klein-Gordon equation, 

ncj)^V^. (2.82) 

Notice that while this equation has the standard form, it couples the scalar to the aether since 
generally we will have V^ — V^{6,(p). 

Note that the equations of motion for the pure ae-theory follow simply hy setting cj) — Q 
and V{6— 

2.2.3 Einstein-Aether Cosmology 

In this section we examine the evolution of FLRW cosmological solutions in Einstein-aether 
theory. Consider a flat FERW background geometry evolving in conformal time, t. 


ds^— a^{T){-dT^ + dx^). (2.83) 

In pure ae-theory, we take the 0-0 and trace Einstein equations to obtain the Friedmann 
equations. 




(2.84) 

(2.85) 


where ^ = a'/a — dlna/dz is the conformal time Hubble parameter. These are exactly the 
Friedmann equations of general relativity except that, as discussed above, the bare cosmologi¬ 
cal constant, G, is renormalised, 

G 

l+87rGa’ 


( 2 . 86 ) 
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with a = (ci + 3 c2 + € 3 ) 171 ^. The aether does not change the cosmological dynamics at all, but 
just modifies the gravitational constant. This arises because in a homogeneous and isotropic 
background the Einstein-aether terms for the vector field only contribute stress-energy that 
tracks the dominant matter fluid, so the associated energy density is proportional to FP 

[126] .20 

The aether does contribute dynamical stress-energy once we couple it to a scalar. In the 
theory (2.76) the Friedmann equations are 




8nGc , 

- a 

3 

TttGc 2 
- a 


^V-eVe+pm + ^fp'^a 


(2.87) 

- Pm{l+8w] + 2{V- 9Vg]-2(j)'^a~^ , 


( 2 . 88 ) 


For completeness we have included a matter component, but in the rest of this section and in 
chapter 8 we will assume that (j) is gravitationally dominant and ignore any p^. The scalar 
field obeys the usual cosmological Klein-Gordon equation. 


(j)" + 2M’(j)' + a^V^^0. 


(2.89) 


As discussed above, the coupling to 9 is contained in the function 1^. In the background, 
9 — 3mH, with H — ^/a the cosmic-time Hubble parameter, so this contributes either 
Hubble friction or a driving force [129]. 

We need not write down the aether field equations in the background. The vector field 
must be aligned with the cosmic rest frame due to homogeneity and isotropy, and its value, 

(2.90) 

a 

is determined completely by the normalisation condition, = —m^. One can check 
that this solution satisfies the spatial component of the aether equation, while the temporal 


2°Note, however, that perturbations of the aether do carry some dynamics [127]. 
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component only determines the Lagrange multiplier. In pure ae-theory this solution is stable 
perturbatively [127,130-132] and that stability holds nonlinearly for most large perturbations 
[133]. This statement is subject to several constraints on the c, parameters which can be 
found in, e.g.. Refs. [122, 127, 132], and we will assume throughout this thesis that these 
constraints hold. One of the important results in chapter 8 is that the coupling between 
and (j) can render cosmological solutions unstable for large regions of parameter space that 
are allowed by other experimental, observational, and theoretical constraints. 

When the scalar potential is V[0,(p) — Vicp), the background aether is irrelevant apart 
from rescaling Newton’s constant, and many choices for the potential can lead to periods of 
slow-roll inflation [134] . Adding a coupling to the aether will change the dynamics but may 
still allow for inflation [129]. We will therefore aim to be as general about V[0,(p) as possible 
when discussing perturbation theory. 




Parti 

A Massive Graviton 


It was therefore quite a shock when he said, “But why should anybody be interested 
in getting exact solutions of such an ephemeral set of equations?" I remember very 
well this word “ephemeral.” It meant that he did not consider his gravitational 
equations as the last word. 


Cornelius Lanczos, Einstein: The Man and His Achievement 




There is nothing stable in the world; uproar’s your only music. 


John Keats, Letters 


Cosmological Stability of Massive Bigravity 



In the previous chapter, and in particular in section 2.1, we discussed an approach to modi¬ 
fying gravity in which its force-carrier particle, the graviton, is given a small mass. In partic¬ 
ular, by specialising to the dRGT interaction potentials (2.22) we ensure that the notorious 
Boulware-Deser ghost mode is absent, and by allowing both metrics to be dynamical and 
taking the graviton mass to be of the order of the present-day Hubble rate, we can obtain 
cosmological solutions which agree well with observations of the cosmic expansion history. 
These solutions are self-accelerating: the Hubble parameter goes to a constant at late times 
even in the absence of a cosmological constant. The action of this bimetric theory, or bigravity, 
is given by equation (2.30), and the associated modified gravitational field equations were 
presented as equations (2.39) and (2.40). 

We have discussed work comparing these FLRW solutions to tracers of the expansion his¬ 
tory, most notably in Refs. [79, 80, 85] . The natural next step is to move beyond the assumption 
of homogeneity and isotropy and allow for linear perturbations to FLRW which could describe 
the formation of large-scale structure. In particular, by employing the frequently-used sub¬ 
horizon and quasistatic approximations we can dramatically simplify the complicated system 
of perturbed field equations while still capturing most observable linear modes. Indeed, this 
is precisely what we will do in chapter 4. 
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The quasistatic limit is, however, a valid approximation only if the full system is stable for 
large wavenumbers. Previous work [82, 135,136] has identified a region of instability in the 
past.^ The aim of this chapter is to investigate this problem in detail. 

The rest of this chapter is organised as follows. In section 3.1 , we present the linearised Ein¬ 
stein and fluid conservation equations in massive bigravity, which are derived in appendix A, 
present a scheme for counting the number of independent, dynamical degrees of freedom, 
and then use that counting to pick a useful gauge. Putting this all together, we discuss how to 
reduce the system of ten Einstein and conservation equations to two equations for the two 
independent fields. In section 3.2, we solve these equations when the background can be 
assumed to be slowly veirying, which is valid on small scales, and obtain the eigenfrequencies 
for the various bimetric models. With these in hand, we analytically determine the epochs 
of stability and instability for all the models with up to two free parameters which have 
been shown to produce viable cosmological background evolution. Tbe behaviour of more 
complicated models can be reduced to these simpler ones at early and late times. 

We show that several models which yield sensible background cosmologies in close 
agreement with the data are in fact plagued by an instability that only turns off at recent times. 
This does not necessarily rule out these regions of the bimetric parameter space, but rather 
presents a question of how to interpret and test these models, as linear perturbation theory is 
quickly invalidated. Remarkably, we find that only a particular bimetric model—in which only 
the Pi and parameters are nonzero (that is, the linear interaction and the cosmological 
constant for the reference metric are turned on)—is stable at all times when the evolution is 
on a particular branch. This shows that a cosmologically-viable bimetric model without an 
explicit cosmological constant does indeed exist, and raises the question of howto nonlinearly 
probe other corners of bigravity. We summarise and discuss our results in section 3.3. 


^This should not be confused with the Higuchi ghost instability, which affects most massive gravity cosmolo¬ 
gies and some in bigravity, but is, however, absent from the simplest bimetric models which produce ACDM-like 
backgrounds [92]. 
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3.1 Linear Cosmological Perturbations 

In this section we set up the formalism for cosmological perturbation theory in massive hi- 
gravity. We define the scalar perturbations to the FLRW metrics by extending equations (2.48) 
and (2.49) to^ 

= -N^[l + Eg)dt^ +2NadiFgdtdx‘ +a^ [(1 +Ag]5ij + didjBg^ dx‘dx^, (3.1) 

dsj^-X\l + Ef)dt^ + 2XYdiFfdtdx^+ Y^ [il+Af)5ij+ didjBf] dx'dx^, (3.2) 

where the perturbations {Egj,Agj, Fgj, Bgj] are allowed to depend on both time and space. 
Spatial indices are raised and lowered with the Kronecker delta. The stress-energy tensor is 
defined up to linear order by 


T°o — ^p(l + d), 
ro^-{p+P)v\ 
T\^[p + P)[vi + diFg), 



(3.3) 


where 5 is the density contrast, = dx^ld t is the 3-velocity, and T,‘j is the anisotropic stress, 
with S' ; = 0. We specialise immediately to dust {P — 5P — T}j — 0) and define the velocity 
divergence, 6 = diV\ 

3.1.1 Linearised Field Equations 

The linearised Einstein and conservation equations are arrived at by a fairly lengthy computa¬ 
tion. We summarise the results here; details on the derivation can be found in appendix A. 
The g-metric Einstein equations are 

^By leaving the lapse N in the g metric general, we retain the freedom to later work in cosmic or conformal 
time. There is a further practical benefit: since this choice makes the symmetry between the two metrics 
manifest, and the action is symmetric between g and / as described in section 2.1.2, this means the / field 
equations can easily be derived from the g equations by judicious use of ctrl-f. 
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• 0 - 0 : 


3H 

w 



4)+v^ 


-4+h 


2 F^ 

Na 



+ ^yP(3AA + V2AB) = 

(3.4) 


0 -/: 


^ 5 , (. 4 , - HE,) + {xFf-yF,)= ^ST\. 


M2 

g 


(3.5) 


i-i: 


1 


( 


LV 


„ N 

2H + 3H^-2—H 
N 


+m 


Eg + El Eg Ag 3HAg + 
xPAE + yQ rAA + ^ 




(3.6) 


• Off-diagonal i-j: 

1 tn^ 1 

- 2 ^‘^.Of: - = -^d r,. (3.7) 

where H = a/a is the usual g-metric Huhhle parameter (in cosmic or conformal time, de¬ 
pending on N), dj- + in the i-i spatial Einstein equation refers to derivatives with respect 
to the other two Cartesian coordinates, i.e., V2 - d‘di where the i indices are not summed 
over, and we have defined 


P = ^1+ 2^2y + 

(3.8) 

Q = -H (x -fy) Pz + xy Ps, 

(3.9) 

x = X/N, 

(3.10) 

y = Yla, 

(3.11) 

1 

III 

<1 

(3.12) 

AB _ Bf Bg, 

(3.13) 

AE = Ef — Eg, 

(3.14) 
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as well as 


+ 3BA 2F„ if.. N. 

D„= ^ —-^ + —-r - 

^ a2 iV ' a N ] Na N^ \ ^ N ^ 


N y a N ^ 

The linearised Einstein equations for the / metric are 


(3.15) 


• 0 - 0 : 


3K 


[ke,-A,)+w^ 


^a-kI 

\XY 


P 


(sAA + V^AB) =0, (3.16) 


• 0-F 


Yg^{Af-KE,) + 


P I a 
Ml y2 X +y X 


di (yFg - xFf) = 0, (3.17) 


i-r. 


1 


{ 


^K + 3K^-2^Kyf + KEf-Af-3KAf + ^Af 

1 


+ 2 




Df 


Ml xy^ 


^FAF + Q j^AA + i ^ df) AB 


= 0, (3.18) 


Off-diagonal i-j: 


1 tn^ O 

-d^diDf + - -^d^djAB^Q, 

2 ^ ^ 2M2xy2 ^ 


(3.19) 


where K=Y/Y is the /-metric Huhhle parameter and we have again defined 


Af + Ef Kf4Ef 3Bf\ 2Ef if.. X. 

Df= ^ , +— —- -\+ — r Bf--Bf 

^ Y^ X \ Y X XY X^\ ^ X ^ 


(3.20) 


Finally the fluid conservation equation, F^^v = 0, can be split into time and space parts, 
neither of which is changed from general relativity, 

• Energy conservation (v = 0): 


V^r^ = -(jd+3Hp)(l + 5)-p 


5 + e + -A^ + -X^B 
2 2 


(3.21) 
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• Momentum conservation (v = i): 


= [p +4Hp) [vi + diFg) +p [vi + diFg) + ^pdtEg. (3.22) 

We have assumed for simplicity that the fluid comprises only pressureless dust. These are 
in agreement with the results found elsewhere in the literature using various choices of 
gauge-invariant variables [3, 75, 82]. 

It is worth mentioning that the g-metric i-i equation, (3.6), is identically zero in GR after 
taking into account the 0 -i, off-diagonal i-j, and momentum conservation equations and 
hence gives no information; in massive (hi) gravity, however, it is crucial, and is manifestly 
only important when m / 0. In a gauge with Fg = Ff — Oit has the simple form 

m^^P^xEf-yEg^+2yQAA^—0. (3.23) 

Performing the same steps on the /-metric i-i equation, we arrive again at equation (3.23). 
Hence both i-i equations carry the same information. We see there is an extra algebraic 
constraint hidden in the system of perturbation equations; this is closely related to the 
nontrivial constraint which eliminates the Boulware-Deser ghost,^ and will become important 
shortly when discussing the degrees-of-freedom counting in bigravity. 

Herein we will decompose the perturbations into Fourier modes without writing mode 
subscripts: every variable will implicitly refer to the Fourier mode of that variable with 
wavenumber k. 

3.1.2 Counting the Degrees of Freedom 

While we have ten equations for ten variables, there are only two independent degrees of 
freedom.^ These can be seen as corresponding, for example, to the scalar modes of the two 
gravitons or to the matter perturbation and the scalar mode of the massive graviton. To 


^We thank Shinji Mukhoyama for discussions on this point. 

'‘The discussion in this section is indebted to useful conversations with Macarena Lagos and Pedro Ferreira. 
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understand the degrees-of-freedom counting, we will start with the simpler waters of general 
relativity, using the language we have employed for bigravity and following the spirit of the 
discussion in Ref. [137]. The time-time and time-space perturbations. Eg and Eg, as well as 
the velocity perturbation, 0 , are auxiliary in that they appear in the second-order action 
without derivatives.® Therefore their equations of motion are algebraic constraints which 
relate them to other perturbation variables, and they can be removed from the system trivially. 
This leaves us with three dynamical variables, Ag, Bg, and 5, two of which can be gauge 
fixed, or set to a desired value (such as zero) by a coordinate transformation. Hence at linear 
order general relativity only has one dynamical (scalar) degree of freedom propagating on 
FLRW backgrounds. In inflation, for instance, this is often taken to be the comoving curvature 
perturbation, 

This is relatively straightforward to extend to massive bigravity; we point the reader to 
Refs. [137, 138] for in-depth discussions. Few complications are introduced because the only 
new components of the theory are an Einstein-Hilbert term for which has exactly the 
same derivative structure as in general relativity, and a mass term, which has no derivatives. 
Therefore we can see immediately that five of the perturbations— Eg, Ef, Eg, Ef, and 0 — 
are nondynamical and can be integrated out in terms of the dynamical variables and their 
derivatives. As discussed in section 2.1.3, the coordinate invariance in massive bigravity is 
effectively the same as in general relativity, as long as we view the gauge transformations 
as acting on the coordinates, rather than on the fields. This can be seen by the fact that 
the Einstein-Hilbert terms are clearly invariant under separate diffeomorphisms for g^v and 
fur, and the mass term is invariant as long as g~^f is, which is the case if we act the same 
coordinate transformation on each of them. We can therefore gauge fix two of the dynamical 
variables. 

Because we have started with ten perturbation variables, five of which were auxiliary and 
two of which can be gauge fixed, we are now left with three dynamical variables. However, 
they are not all independent. After the auxiliary variables are integrated out, one of the 

®Specifically, they appear without time derivatives. Recall that we are working in Fourier space where spatial 
derivatives effectively amount to multiplicative factors of ik. 
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initially-dynamical variables becomes auxiliary, i.e., its derivatives drop out of the second-order 
action, and it can itself be integrated out. This leaves us, as promised, with two independent, 
dynamical degrees of freedom. 

It is therefore possible to reduce the ten linearised Einstein equations to a much simpler 
system of two coupled second-order differential equations. As we will see, with the right 
choice of gauge this process is fairly simple. This will allow us, in section 3.2, to check whether 
the solutions to that system are stable. 

3.1.3 Gauge Choice and Reducing the Einstein Equations 

In Ref. [137] a method for identifying the gauge-invariant degrees of freedom was presented 
in which Noether identities are used to identify “good” gauges, i.e., gauges in which the 
equations of motions for the gauge-fixed variables are contained in the remaining equations 
of motion. This methodology was applied to massive bigravity in Ref. [ 138] . While the method 
was developed with a focus on deriving the second-order action for the perturbations, rather 
than starting with the equations of motion as we do here, we will find that this method of 
picking a gauge will be convenient. 

Many common gauges choose to fix auxiliary variables, but this makes the job of reducing 
the perturbation equations to the minimal number of degrees of freedom difficult. By contrast, 
in the Noether-identity method only dynamical variables are fixed. Specifically, one chooses 
to eliminate those variables whose equations of motion are redundant, i.e., are contained 
within the equations of motion for fields which we leave in, so that no physical information 
is lost. Because the equations of motion for the perturbation variables are the same as the 
Einstein equations we are using, such a gauge choice works well for our purposes. The 
end result is that we should choose to eliminate one of {Ag,Af} and one of {Bg,Bf,x] [138], 
where x = + {3/2]k~^Ag - [l/2]Bg, which characterises the fluid flow, is the basic scalar 

dynamical degree of freedom for a perturbed fluid [139]. This uses up all of the available 
gauge freedom. 

In this chapter we will work in a gauge in which Af — x — 0. This has the advantage of 
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treating the two metrics symmetrically: the remaining independent, dynamical fields are Bg 
and Bf, with Ag having become auxiliary in the process. Our goal is to derive the reduced 
system of equations of motion for Bg and Bf, as well as expressions relating all of the rest 
of the perturbation variables to these. Because the resultant equations are extraordinarily 
lengthy, we will not present them but will simply summarise the steps. 

Five equations—the 0-0 and 0-i Einstein equations for each of the metrics and the energy- 
conservation equation—correspond directly to the equations of motion for the five auxiliary 
variables. In these equations the auxiliary variables only appear linearly and without deriva¬ 
tives. Therefore we can easily “integrate them out” by solving the system of those five equa¬ 
tions to obtain {Eg,Ef, Eg, Ef, 0} in terms of the remaining degrees of freedom, [Ag, Bg, Bf}, 
and their derivatives. 

We are left with Ag, Bg, and Bf, and their equations of motion are the g-metric i-i, g- 
metric i-j, and /-metric i-j equations, respectively. As discussed above, the i-i equations 
effectively become constraints after manipulation with the other equations of motion. We 
demonstrated this in a gauge where Eg — Ef — 0; while mathematically simple, this gauge 
is not very helpful as it only eliminates auxiliary variables. In the more convenient gauge 
we are now using there is an equivalent statement: after integrating out the five auxiliary 
variables, all derivatives of Ag vanish from the g-metric i-i equation, so that it can be used 
to solve algebraically for Ag in terms of Bg, Bf, and their first derivatives. This is the result, 
mentioned above, that after integrating out the auxiliary variables, one of the dynamical 
variables becomes auxiliary. We note that Ag only depends on Bg and 6/ up to first derivatives. 
This fact is crucial because Ag (though not Ag) appears in the remaining equations of motion. 
If Ag depended on second derivatives, then higher derivatives would appear upon integrating 
it out, and we would be in danger of a ghost instability. Indeed, as mentioned above, the fact 
that Ag loses its dynamics is nothing other than the Boulware-Deser ghost being rendered 
nondynamical by the specific potential structure of massive bigravity [140]. 
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3.2 Stability Analysis 


Having reduced the system of linearised Einstein and conservation equations using the 
steps outlined in section 3.1.3, we can write our original ten equations as just two coupled 
second-order differential equations. Defining the vector 


f \ 

Bs 


yBfj 


(3.24) 


this system takes the simple form 


x-l-Ax-l-Sx = 0, 


(3.25) 


where A and B are matrices with extremely unwieldy forms which depend only on back¬ 
ground quantities and on k. Equivalently we can write equation (3.25) in the same form in 
terms of iV = Ina (not to he confused with the g-metric lapse, N), 

x" + A:^ + Bx^0, (3.26) 


where we use primes to denote derivatives with respect to N. We will choose this formulation 
to simplify the analysis and better understand its physical consequences. 

We are now in a good position to analytically probe the stability of linear cosmological 
perturbations in bigravity. If we neglect the dependence of A and B and treat them as 
constants, then equation (3.26) is clearly solved by a linear superposition of exponentials 

x = ^x,.e'"‘^. (3.27) 

i 

We refer to as eigenfrequencies because they can be determined by solving for the eigenval¬ 
ues oficOiA + B. While A and B are not truly constant, they do in fact vary slowly enough 
for the WKB approximation to hold, in which case equation (3.27) is the correct first-order 
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solution for x. The criterion for the WKB approximation to hold is \ co'/co'^\ <. 1; this will he 
satisfied in the cases in which we are interested. 

The criterion for stability is that all eigenfrequencies he real. This is necessary to obtain 
purely oscillating solutions for the perturbation variables; if any eigenfrequency had an 
imaginary piece, then there would be exponentially growing modes. The eigenfrequencies we 
obtain from equation (3.26) are inordinately complicated.® To simplify the analysis and focus 
on subhorizon scales, which account for most of the modes we can observe, we will take the 
limit k » where, as elsewhere in this thesis, we denote the conformal-time Hubble rate by 

As discussed in section 2.1.3, the only theory with one nonzero yS, parameter which allows 
for viable cosmological background expansion is the ySi model, providing an excellent fit 
to background data including Type la supernovae, the cosmic microwave background, and 
baryon-acoustic oscillations [80,85]. However, the analysis in this chapter shows that it suffers 
from instabilities throughout most of cosmic history. In the limit of large we find the 

eigenfrequencies for this model are 


_ k V-l + 12y2 + 9y4 

l+3y2 


(3.28) 


The condition for stability, i.e., for the object inside the square root to be positive, is 


y > 



(3.29) 


This suggests that there is an instability problem at early times; recall from section 2.1.3 that 
the ySi model has only finite-branch solutions, meaning that y evolves monotonically from 0 
at early times to, at late times, a positive constant. Therefore the ySi model always suffers from 
an instability at early times, with a turnover from unstable to stable occurring when y 0.28. 
This instability is quite dcingerous. Consider scales of A: ~ lOOJ^f, which is a typical mode size 

®We recognise that the number of unused synonyms for “these equations are very long” is growing short as 
this chapter progresses. 
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for structure observations. Those modes would then grow, assuming Im(<u) ~ ^(1), as roughly 
giooiv^ which is far too rapid for linear theory to be applicable for more than a fraction of an 


e-fold. 


During what time period is this instability present? If y reached 0.28 at sufficiently early 
times, one might expect that the presence of radiation or other new fields, which we have 
ignored in favour of dust, could ensure that modes me stable. However, the time at which the 
instability turns off is generically close to the time at which the expansion begins to accelerate, 
so this is clearly a modern problem. We can find the exact region of instability recalling that, 
in this model, we can solve for y (a) exactly, c.f. equation (2.66), 



(3.30) 


where 



(3.31) 


where Bi = and Ho is the cosmic-time Hubble rate today. The best-fit value for Bi 

using a combination of SNe, CMB, and BAD data is Bi — 1.448 ± 0.0168 [2, 80]. For Bi = 1.448 
exactly, cop^ switches from imaginary to real at iV = -0.49, corresponding to a relatively recent 
redshift, z = 0.63435. This number is fairly sensitive to the choice of datasets. The CMB 
and BAO data are taken from observations which assume general relativity in their analysis. 
Restricting the analysis to supernovae alone, the best fit is Bi = 1.3527 ± 0.0497. In this case, 
the instability ends at iV = -0.38 or z = 0.47. At any epoch before this, the perturbation 
equations are unstable for large k. This behaviour invalidates linear perturbation theory on 
subhorizon scales and may rule out the model if the instability is not cured at higher orders. 
This is not necessarily out of the realm of possibility. As discussed earlier, massive bigravity 
possesses the Vainshtein mechanism, in which nonlinear effects suppress the helicity-0 mode 
of the massive graviton in dense environments, thereby recovering general relativity [54, 55] . 
It may be the case that such a mechanism will also impose general-relativistic behaviour on 
nonlinear cosmological perturbations. 
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Now let us move on to more general models. As we mentioned in section 2.1.3, the 
other one-parameter models are not viable in the background,' i.e., none of them has a 
matter-dominated epoch in the asymptotic past and produces a positive Hubble rate [85].® 
Nevertheless it is worthwhile to calculate the eigenfreqencies in these cases in order to study 
the asymptotic behaviour of the viable multiple-parameter models. For simplicity, from now 
on we refer to a model in which, e.g., only ySi and ^2 are nonzero as the P 1 P 2 model, and so on. 

At early times, every viable, finite-branch, multiple-parameter model is approximately 
described by the single-parameter model with the lowest-order interaction. For instance, 
the P 1 P 2 , P 1 P 3 , and P 1 P 2 P 3 models all reduce to ySi, the P 2 P 3 model reduces to P 2 , and so on. 
Similarly, in the early Universe, the viable, infinite-branch models reduce to single-parameter 
models with the highest-order interaction. This is clear from the structures of the terms in 
the Friedmann equation and the P and Q parameters introduced above. It is only through 
these terms that the yS, parameters enter the perturbation equations. Therefore, in order 
to determine the early-time stability, we need only look at the eigenfrequencies of single¬ 
parameter models. In addition to cop^ presented in equation (3.28) above, we have 


k 1 

(3.32) 

k ^-3-H8y2-y4 


v^(i-y2) ’ 

(3.33) 

k 1 



(3.34) 


We see that the P 2 and models are stable at all times, while the ySa model suffers from an 
early-time instability just like the ySi model. We can now extend these results to the rest of 
the bigravity parameter space by using the single-parameter models to test the early-time 
stability. 

Since much of the power of massive bigravity lies in its potential to address the dark energy 
problem in a technically-natural way, let us first consider models without an explicit g-metric 


^With the exception of the ySo model, which is simply ACDM. 

® We frequently discuss the viability of various models in this section; all such results were derived in Ref. [85] . 
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cosmological constant, i.e., ySo = 0. On the finite branch, all such models with ySi / 0 reduce, 
at early times, to the ySi model. As we have seen, this possesses an imaginary sound speed 
for large k, cf. equation (3.28), and is therefore unstable in the early Universe. Hence the 
finite-branch P 1 P 2 P 3 P 4 model and all its subsets with ySi 7 ^ 0 are all plagued by instabilities. 
This is particularly significant because all of these models otherwise have viable background 
evolutions [85]. This leaves the P 2 P 3 P 4 model; this is stable on the finite branch as long as 
P 2 7 ^ 0, but its background is not viable. We conclude that there are no models with ySo = 0 
which live on a finite branch, have a viable background evolution, and predict stable linear 
perturbations at all times. 

This conclusion has two obvious loopholes: we can either include a cosmological constant, 
Po, or turn to an infinite-branch model. We first consider including a nonzero cosmological 
constant, bearing in mind this may not be as interesting theoretically as the models which 
self-accelerate. Adding a cosmological constant can change the stability properties, although 
it turns out not to do so in the finite-branch models with viable backgrounds. In the ySoySi 
model, the eigenfrequencies. 



(3.35) 


l+3y 


are unaffected by Po at early times and therefore still imply exponential mode growth in the 
asymptotic past. This result extends (at early times) to the rest of the bigravity parameter 
space with Po, Pi / 0. No other finite-branch models yield viable backgrounds. In conclusion, 
all of the solutions on a finite branch, for any combination of parameters, are either unviable 
(in the background) or linearly unstable in the past. 

Let us now turn to the infinite-branch models. There are two candidates with viable 
background histories. The first is the the P 0 P 2 P 4 model. The reality of and cf. 
equation (3.32), suggests that this model is linearly stable. At the background level, this case 
is something of an exception as it is the only bimetric cosmology with exact ACDM evolution: 
the structure of the Friedmann equation and quartic equation conspire to allow the dynamics 
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to be rewritten with a modified gravitational constant and an effective cosmological constant 
[79], 


2 ^ Pi P 2 ^ 0 ^ 4 - 9^1 

Pi — 3^2 Pi — 3^2 

The quartic equation for y can be solved to find 

+ ^0 - 3^2 
V =- - -. 

Pi — 3^2 


(3.36) 


(3.37) 


This implies that all solutions to this model live on the infinite branch. In order for y to 
be real at all times, we are required to have Po - 3 P 2 > 0 and Pi — 3 P 2 > 0. Unfortunately, 
for these reasons we cannot have a viable self-accelerating solution; if Po were set to zero 
(or were much smaller than P 2 and PP, then the effective cosmological constant would be 
negative. The modified gravitational constant would also be negative if Pi were positive. From 
a cosmological point of view, these models are therefore not altogether interesting. 

Finally there is a small class of viable and interesting models which have stable cosmologi¬ 
cal evolution: the self-accelerating PiPi model and its generalisation to include ySo.® Flere, y 
evolves from infinity in the past and asymptotes to a finite de Sitter value in the future. For 
these PoPiPi models we perform a similar eigenfrequency analysis and obtain 




:± 


k + 2 (^o/^i)y + 12y^ + {d + 2PoPi/Pf^y'^ — 2 [Pi/Pi) [4y^ -F3y® — iPi/Pi)y^ 




l-h3y2-2(^4/^i)y= 


(3.38) 


Restricting ourselves to the self-accelerating models (i.e., Po — 0), we obtain 

k 7-1 + 12y2 + 9y* - 2 (P,/Pi) [4y3 + 3y= 

l+3y^-2(ft//3.)y= ' 

Notice that at early times, i.e., for large y, the eigenvalues (3.38) and (3.39) reduce to the 


®We do not have the freedom to include nonzero P 2 or ^ 83 ; in either case the background evolution would 
not be viable [85]. We can see this from the expressions (2.59) and (2.60) for p(y) and H^iy). If ft were nonzero, 
then flm = p would diverge as y at early times. Setting ft = 0, we find flm ^ 1 — 3ft/ft as y ^ 00 . If 
we demand a matter-dominated history, then ft must at the very least be small compared to ft. 
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expression (3.34) for o)p^. This frequency is real, and therefore the Pifis, model, as well as its 
generalisation to include a cosmological constant, is stable on the infinite branch at early 
times. 

Interestingly, the eigenfrequencies for this particular model can also be written as 


k I y" _ k 



(3.40) 


Therefore, the condition for the stability of this model in the infinite branch, where y' < 0, 
is simply y" > 0. One might wonder whether this expression for is general or model 
specific. While it does not hold for the ySa and ySs models, c.f. equations (3.32) and (3.33), it 
is valid for all of the submodels of P 0 P 1 P 4 , including the single-parameter models presented 
in equations (3.28) and (3.34). We can see from this, for example, that the finite-branch 
iy' > 0) ySi model is unstable at early times because initially y” is positive. In figure 3.1 
we show schematically the evolution of the P 1 P 4 model on the finite and infinite branches. 
The stability condition on either branch is y''/y' — dy'/dy < 0. For the parameters plotted, 
P 4 = 2 Pi, one can see graphically that this condition is met, and hence the model is stable, 
only at late times on the finite branch but for all times on the infinite branch. Our remaining 
task is to extend this to other parameters. 

Let us now prove that the infinite-branch ^ 4^4 model is stable in the subhorizon limit at 
all times as long as the background expansion is viable, which restricts us to the parameter 
range 0< P 4 < 2ySi [85]. From equation (3.39) we can see that the subhorizon perturbations 
are clearly stable if and only if 


-l + 12y2 + 9y4_2(yg4/ygi) [4y^+3y^ - {^ 4 /Pi)y^] >0 (3.41) 


At early times, y —> 00 , this is dominated by a manifestly positive term. Indeed we have already 
seen that the eigenfrequencies match those in the P 4 model (3.34) which are purely real. At 
later times, equation (3.41) is satisfied for all y > 1 as long as we restrict to the viable parameter 
range, 0 < P 4 /P 1 < 2. We can therefore rephrase the question of stability as a question about 
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y 

Figure 3.1: Plot of the function y'[y) for the ^ 1^4 model for P 4 — 2y3i. For both the finite 
and infinite branches, the final state is the de Sitter point. The arrows show the direction of 
movement of r. 


the background evolution: do the infinite-branch models in this region of the parameter 
space always have y > 1? 

The answer is yes. Recall from section 2.1.3 that, on the infinite branch, y evolves mono- 
tonically from y = 00 to y = yc, where yc is defined by equation (2.64), 


Ay'-3Ay'+A = o. ( 3 . 42 ) 

Because the evolution of y is monotonic, y > 1 at all times if y^ > 1. Moreover, because y = yc 
corresponds, through the quartic equation (2.58) , to p = 0, we are only interested in the largest 
real root of equation (3.42). For the largest allowed value of P 4 , P 4 — 2y3i exactly, we find yc = 1. 
We must then ask whether for 0 < ^4 < 2ySi, yc remains greater than 1. Writing p = yc - 1, 
using Descartes’ rule of signs, and restricting ourselves to 0 < yS 4 < 2y3i, we can see that p 
has one positive root, i.e., there is always exactly one solution with yc > 1 in that parameter 
range. Therefore, in all infinite-branch solutions with 0 < yS 4 < 2ySi, y evolves to some yc > 1 
in the asymptotic future. We conclude that all of the infinite-branch P 1 P 4 cosmologies which 
are viable at the background level are also linearly stable at all times in the subhorizon limit. 
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providing a clear example of a bimetric cosmology which is a viable competitor to ACDM. 
This stability has been confirmed and extended to the superhorizon limit in a complementary 
analysis in Ref. [138]. 

As a side remark, we note that in this model the asymptotic past corresponds to the limit 
y ^ 00 and y' —> - |y, i.e., y ^ This implies that Y ~ i.e., the second metric 

initially collapses while “our” metric expands. On the approach to the final de Sitter stage, 
y approaches a constant yc, so the scale factors a and Y both expand exponentially. This 
infinite-branch model therefore contains a bouncing cosmology for the / metric. 

This bounce has an unusual consequence. Recall from equation (2.57) that, after imposing 
the Bianchi identity, we have foo — - Therefore, when y bounces, foo becomes zero: 

at that one point, the lapse function of the / metric vanishes, Nevertheless, this does not 
render the solution unphysical, for the following reasons. First, the / metric does not couple 
to matter and so, unlike the g metric, it does not have a geometric interpretation. A singularity 
in therefore does not necessarily imply a singularity in observable quantities. In fact, we 
find no singularity in any of our background or perturbed variables. Second, although the 
Riemann tensor for the / metric is singular when foo — 0, the Lagrangian density 
remains finite and nonzero at all times, so the equations of motion can be derived at any 
points in time. 


3.3 Summary of Results 

In this chapter, we introduced the tools for perturbation theory in massive bigravity and used 
them to test the stability of the theory. 

We began by presenting the cosmological perturbation equations; these are derived in 
appendix A. We went on to detail the way in which the physical degrees of freedom are 
counted and described how to pick a good gauge and integrate out nondynamical variables. 
By doing so we reduced the ten linearised Einstein and fluid conservation equations to a 

^“Moreover, the square root of this, V/J^, appears in the mass terms. We choose branches of the square root 
such that this quantity starts off negative at early times and then becomes positive. 
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system of two coupled, second-order differential equations. These describe the evolution of 
the two independent, dynamical degrees of freedom present at linear order around FLRW 
solutions in massive bigravity. 

We then identified the stable and unstable models by employing a WKB approximation 
and calculating the eigenfrequencies of the perturbation equations. This analysis revealed 
that many models with viable background cosmologies exhibit an instability on small scales 
until fairly recently in cosmic history. However, we also found a class of viable models which 
are stable at all times. These are defined by giving nonzero, positive values to the interaction 
parameters ySi and ^ 4 , setting P 2 — P'i — 0, and choosing solutions in which the ratio y — Y/a 
of the two scale factors decreases from infinity to a finite late-time value. A cosmological 
constant can be added without spoiling the stability, although it is not necessary; the theory 
is able to self-accelerate. 

On the surface, these results would seem to place in jeopardy a large swath of bigravity’s 
parameter space, such as the “minimal” ySi-only model which is the only single-parameter 
model that is viable at the background level [85]. It is important to emphasise that the exis¬ 
tence of such an instability does not automatically rule these models out. It merely impedes 
our ability to use linear theory on deep subhorizon scales (recall that the instability is prob¬ 
lematic specifically for large k). Models that are not linearly stable can still be realistic if only 
the grawtational potentials become nonlinear, or even if the matter fluctuations also become 
nonlinear but in such a way that their properties do not contradict observations. The theory 
can be saved if, for instance, the instability is softened or vanishes entirely when nonlinear 
effects are taken into account. We might even expect such behaviour: bigravity models exhibit 
a Vainshtein mechanism [54, 55] which restores general relativity in environments where the 
new degrees of freedom are highly nonlinear. Consequently two very important questions 
remain: can these unstable models still accurately describe the real Universe, and if so, how 
can we perform calculations for structure formation? 

Until these questions are answered, the infinite-branch ^ 1^4 model seems to be the most 
promising target at the moment for studying massive bigravity. In the next chapter, we will 
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calculate its predictions for structure formation, confront them with data, and discuss the 
potential of near-future probes like Euclid to test this model against ACDM. 



The wonder is, not that the field of the stars is so vast, but that man has measured it. 


Anatole France, The Garden of Epicurus 

Linear Structure Growth in Massive 

Bigravity 



We have, to this point, reviewed the background FLRW solutions of massive bigravity in 
chapter 2 and begun to analyse linear perturbations around these solutions in chapter 3. In 
addition to introducing the formalism for cosmological perturbation theory in bigravity, the 
specific aim of the previous chapter was to identify which models are stable at the linear level 
and which are not. The natural next step is to use perturbation theory to derive observable 
predictions for the stable models. 

In this chapter we undertake a study of the cosmological large-scale structure (LSS) in 
massive bigravity with the aim of understanding the ways in which bigravity deviates from 
general relativity and its potentially-testable cosmological signatures. This is motivated in 
particular by anticipation of the forthcoming Euclid mission which is expected to improve the 
accuracy of the present large-scale structure data by nearly an order of magnitude [87, 88] . 

At the background level, careful statistical analyses show that several bimetric models can 
provide as good a fit as the standard ACDM model, including one case, the ySi-only model, 
which has the same number of free parameters as ACDM [80, 85, 141].^ This is a blessing 

Ws discussed in chapter 3, this model is linearly unstable. However, if the instability is cured at higher orders 
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and a curse; while it is encouraging that massive bigravity can produce realistic cosmologies 
in the absence of a cosmological constant, the quality of background observations is not 
good enough to distinguish these models from ACDM or from each other. In particular, the 
parameter constraints obtained from the expansion history have strong degeneracies within 
the theory itself. 

To efficiently test the theory and distinguish its cosmology from others one needs to 
move beyond the FLRW metric and study how consistent bimetric models are with the 
observed growth of cosmic structure. We restrict ourselves to the linear, subhorizon regime 
and examine whether there are any deviations from the standard model predictions which 
may be observable by future LSS experiments. 

Note that due to the aforementioned instability, some (though not all) bimetric models 
cannot be treated with linear perturbation theory at all times, as any individual mode would 
quickly grow large during the period of instability (from early times until some recent redshift). 
Structure formation must be studied nonlinearly in these cases. In order to demonstrate 
our methodology, we choose to study every model with a sensible background evolution 
and one or two free yS, parameters, so some unstable cases will be included. For the most 
part these should be seen as toy models, useful for illustrative purposes, although our results 
about quantities which do not depend on initial conditions, such as the anisotropic stress, 
will be observationally relevant at later times. One specific model which we study, the infinite- 
branch P 1 P 4 model, is stable at all times, and so we will pay extra attention to its comparison 
to observations. 

This chapter is organised as follows. Taking the subhorizon and quasistatic limit of the 
full perturbation equations, we arrive at a convenient closed-form evolution equation in 
section 4.1 that captures the modifications to the general relativistic growth rate of linear 
structure, as well as the leading-order scale-dependence which modifies the shape of the 
spectrum at near-horizon scales. The coefficients in the closed-form equation are given 
in appendix B. The results are analysed numerically in section 4.2, where we discuss the 


in perturbation theory before the background solution is spoiled, then at the background level this is a perfectly 
viahle model. 
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general features of the models and confront them with the observational data. We conclude 
in section 4.3. 


4.1 Perturbations in the Subhorizon Limit 


We define the perturbed metrics in conformal time (iV = a) as 


^—{1 + Eg)dz^ +2diFgdTdx' + [(1 +Ag)5ij +didjBg^ dx^dx^^, (4.1) 


dsj = -X\l + Ef)dz^ + 2XYdiFfdxdx^+ Y^ [il+Af)5ij YdidjEf] dx'dx^, (4.2) 

and immediately specialise to dust iP = 5P — 0) and work in Fourier space. The linearised 
Einstein and fluid conservation equations have been presented in section 3.1 and are derived 
in appendix A. These equations are quite complicated; in order to isolate the physics of 
interest, namely that of linear structure in the subhorizon regime, we focus on the subhorizon, 
quasistatic limit of the field equations. This limit is defined by taking ~ $ for 

any variable where we have expanded in Fourier modes with wavenumber k , and assuming 
K ^ F[ so we can take the same limit in the /-metric equations. Moreover, we will take Agj 
and Egj to be of the same order as kFgj and k^Bgj, as these terms all appear this way in 
the linearised metric. Finally, we will work in Newtonian gauge for the g metric, defined 
by setting Fg — Bg — 0. Since we are working in the singly-coupled theory where is the 
“physical” metric, i.e., only g^y couples (minimally) to matter, this is as sensible a gauge choice 
as it is in GR.^ 

With these definitions, we can write down the subhorizon evolution equations. The energy 
constraint (the 0-0 Einstein equation) for the g metric is 



(4.3) 


^Given two separate diffeomorphisms for the g and / metrics, only the diagonal subgroup of the two preserves 
the mass term. In practice, this means that we have a single coordinate system which we may transform by 
infinitesimal diffeomorphisms, exactly as in GR. 
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The trace of the i-j equation yields 



(4.4) 


The off-diagonal piece of the i-j equation tells us 


Ag + Eg + m^a^yQBf — 0 . 


(4.5) 


We have not presented the momentum constraint (the 0-i Einstein equation) as it has already 
been used, along with the momentum conservation equation (4.10), to simplify the trace 
i-j equation (4.4).^ Note also that we have used the off-diagonal piece, equation (4.5), to 
eliminate some redundant terms in the trace equation. For the / metric, the corresponding 
equations are 



-K+ H + 


Finally, due to the minimal coupling between matter and gfj^y , the fluid conservation equations 
are unchanged from GR, 


5 + 0 = 0 , 


(4.9) 


e+He--k^Eg^o. 

2 ® 


(4.10) 


In GR the trace equation, (4.4), adds no new information: it becomes an identity after 
using the Friedmann equation. In massive bigravity this equation does carry information and 
it is crucial that we use it. However, we can still simplify it by using the background equations, 

^In the approach of Ref. [142], where this limit is taken hy dropping all derivative terms, this step is crucial for 
the results to be consistent; in our case it is simply useful for rewriting derivatives of Tg and Ag in terms of Eg, 
so that the equation is manifestly algebraic in the perturbations. 
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obtaining^ 

m^^P^xEf — yEg^+2yQAA^—0. (4.11) 

Note that the m —> 0 limit yields an identity, as expected. There is a further interesting 
feature: if we substitute the background equations into the /-metric trace equation, (4.7), we 
obtain equation (4.11) again. Hence one of the two trace equations is redundant and can be 
discarded, so what looks like a system of six equations is actually a system of five. 

With these relations, the system of equations presented in this section is closed. 


4.2 Structure Growth and Cosmological Observables 

In this section we study the linearised growth of structure in the quasistatic and subhorizon 
limit, first solving the field equations to obtain predictions and then comparing to data. 
Deviations from the predictions of general relativity can be summarised by a few parameters 
which are observable by large-scale structure surveys such as Euclid [87, 88]. The main aim of 
this section is to see under what circumstances these parameters are modified by observable 
amounts in the linear regime by massive bigravity. 


4.2.1 Modified Gravity Parameters 

We will focus on three modified growth parameters, defined in the Euclid Theory Working 
Group review [88] as / (and its parametrisation y), Q, and rj. They are: 


Growth rate (/) and index (y): These parameters measure the growth of structure, and are 
defined by 

dlovd 

(4.12) 


d\og5 
f{a,k]=— - 


dloga 

where = a^p/{3M^^EP] is the usual matter density parameter. 


Modification of Newton’s Constant (Q): The function Q{a,k)^ parametrises modifications 


^This equation holds beyond the subhorizon limit, in a particular gauge; see appendix A. 

®Not to be confused with the background quantity defined in equation (3.9), Q = Pi + [x + y] P 2 + xy 
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to Newton’s constant in the Poisson equation, 


F _Q{a,k]p 

9 — Ti /f-9 ^ * 


(4.13) 


Anisotropic Stress (q): Effective anisotropic stress leads the quantity Ag + Eg to deviate from 
its GR value of zero, which we can parametrise by the parameter rjia, k], 


As 

r]{a,k] = -—. 

Eg 


(4.14) 


In GR, these parameters have the values y ^ 0.545 and Q — rj — l. 

We have five independent Einstein equations [equations (4.3), (4.5), (4.6), (4.8), and (4.11)] 
for five metric perturbations® and 5. Crucially, this system is algebraic. There are five equa¬ 
tions for six variables, so we can only solve for any five of the perturbations in terms of the 
sixth. Of the modified growth parameters, Q and rj are ratios of perturbations so are insensi¬ 
tive to how we solve the system. However, to find y we need to solve a differential equation 
for 5. It is therefore simplest to solve for the perturbations {Agj, Egj, Bf] in terms of 5. 

Solving the system, we find each perturbation can be written in the form /(t, k)5, for 
some function f[t). We do not display the solutions here as they are quite unwieldy, although 
we do note that in the limit with only ydi / 0 studied in Ref. [142], and taking into account 
differences in notation and gauge, our expressions for the perturbations match theirs. 

With these solutions for {Agj, Egj, Bf} in hand, we can immediately read off Q and rj. 
To calculate the growth index, y, we need to solve a conservation equation for the density 
contrast, 5. The fluid conservation equations, (4.9) and (4.10), are unchanged from GR, so 
as in GR we can manipulate them to find the usual evolution equation for 5 sourced by the 
gravitational potential, 

5 + H5 + ^k^Egi5)^0. (4.15) 

®After gauge fixing there are six metric perturbations, but once we substitute the 0-/ equations into the trace 
i-j equations, Ff drops out of our system. In a gauge where Pg — Ff — 0, as was used in Ref. [142], the equivalent 
statement is that the Bg and Bf parameters are only determined up to their difference, Bf — Bg, which is gauge 
invariant. 
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At this point we diverge from the usual story. In GR, there is no anisotropic stress and the 
Poisson equation holds; combining the two, we find k^Eg — —[a^p/M'^^)5. Both of these 
facts are changed in massive higravity, so there is a modified (and rather more complicated) 
relation between k^Eg and 5. However, since we do have such a relation, 5 still obeys a closed 
second-order equation which we can solve numerically. 

Finally, we note that the three modified gravity parameters are encapsulated by five 
time-dependent parameters. The expressions for rj and Q can be written in the forms 



(4.16) 


(4.17) 


where the hi are functions of time only and depend on m^^i. We present their explicit forms 
in appendix B. The same result has been obtained for Horndeski gravity [143, 144], which 
is the most general scalar-tensor theory with second-order equations of motion [145]. The 
similarity is a consequence of the fact that massive bigravity introduces only a single new 
spin-0 degree of freedom, its equations of motion are second-order, and the new mass scale it 
introduces (the graviton mass) is comparable to the Hubble scale [142, 146]. 

Furthermore, the structure growth equation, (4.15), can be written in terms of Q and rj 
and hence the coefficients as 


5 + H5-- 


2 n M2 
' g 


1 Q a^p 


(4.18) 


The quantity Q/rj, sometimes called Y in the literature [142, 144,147], represents deviations 
from Newton’s constant in structure growth, and is effectively given in the subhorizon regime 
by(fiifi5)/(fi2fi3)- 
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4.2.2 Numerical Solutions 

In this section we numerically solve for the background quantities and modified gravity 
parameters for one- and two-parameter bigravity models/ We look in particular for potential 
observable signatures, as the growth data are currently not competitive with background 
data for these theories, although we expect future LSS experiments such as Euclid [87, 88] to 
change this. The recipe is straightforward: using equation (2.61) we can solve directly for y(z), 
which is all we need to find solutions for ri[z, k] and Q(z, k) using equations (4.16) cind (4.17). 
Finally these can be used, along with equations (2.59) and (2.60), to solve equation (4.18) 
numerically for 5(z, k) and hence for f[z, k). We fit f[z, k) to the parametrisation in the 
redshift range 0 < z < 5 unless stated otherwise. 

The likelihoods for these models were analysed in detail in Ref. [80], using the Union2.1 
compilation of Type la supernovae [148], Wilkinson Microwave Anisotropy Probe (WMAP) 
seven-year observations of the CMB [149], and baryon acoustic oscillation (BAO) measure¬ 
ments from the galaxy surveys 2dFGRS, 6dFGS, SDSS and WiggleZ [150-152]. We compute 
likelihoods based on growth data compiled in Ref. [153], including growth histories from the 
6dFGS [154], LRG 200 , LRGeo [155], BOSS [156], WiggleZ [157], and VIPERS [158] surveys. 

Both the numerical solutions of background quantities and the likelihood computations 
are performed as in Ref. [80], where they are described in detail. Following Ref. [80], we will 
normalise the yS,- parameters to present-day Hubble rate, Hq, by defining 

Bi = —Pi. (4.19) 

^0 

Throughout, we will assume that the g-metric cosmological constant, Bq, vanishes, as we are 
interested in the solutions which accelerate due to modified-gravity effects. 


^We focus on these simpler models to illustrate bigravity effects on growth. Current growth data are not able 
to significantly constrain these models, so we would not gain anything by adding more free parameters. 
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The Minimal Model 

We begin with the “minimal” model in which only Bi is nonzero. This is the only single-5; 
theory which is in agreement with background observations [80]; the other models also have 
theoretical viability issues [85]. Note, however, that the linear perturbations are unstable 
at early times until relatively recently, z ~ 0.5, as discussed in chapter 3. This restricts the 
real-world applicability of the results presented herein, as the quasistatic approximation we 
employ will not be viable. Our results will hold for observations within the stable period. 
Specifically, our results for Q and rj will certainly hold, while the growth rate, /, may vary if 
the initial conditions for 5 are significantly cheinged from what we assume herein. Otherwise 
this should be seen as an illustrative example. 

The likelihoods for Bi are plotted in figure 4.1 based on supernovae, BAO/CMB, growth 
data, and all three combined, although the growth likelihood is so wide that it has a negligible 
effect on the combined likelihood. The point was raised in Ref. [142] that the WMAP analysis 
is performed assuming a ACDM model and hence may not apply perfectly to these data. 
We will take an agnostic point of view on this and consider both the best-fit value of Bi 
from supernovae alone (Bi = 1.3527 ± 0.0497) and from the combination of supernovae and 
CMB/BAO (Bi = 1.448 ± 0.0168).® The results do not change qualitatively with either choice. 

The growth rate, /, at A: = 0.1 h/Mpc is plotted in the first panel of figure 4.2, along with 
the parametrisation with best fits y — 0.46 for Bi = 1.35 and y — 0.48 for Bi = 1.45. This is 
in agreement with the results of Ref. [142], who additionally found that f{z] is fit much more 
closely by a redshift-dependent parametrisation, + ^ j. In the second panel we 

plot the best-fit value of y as a function of Bi. All values of Bi consistent with background 
observations give a value of y that is far from the GR value (including ACDM and minimally- 
coupled quintessence models) of y 0.545. While present observations of LSS are unable 
to easily distinguish this model from ACDM (cf. figure 4.1), the Euclid satellite expects to 
measure y within 0.02 [87, 88] and should easily be able to rule out either the minimal massive 

®These differ slightly from the best-ht Bi — 1.38 ± 0.03 reported by Ref. [85], also based on tbe Union2.1 
supernovae compilation. 
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Figure 4.1: The likelihood for Bi in the Bi-only model from growth data (red), as well as back¬ 
ground likelihoods for comparison. The fits for Bi effectively depend only on the background 
data; the combined likelihood (black) is not noticeably changed by the addition of the growth 
data. 

bigravity model or GR. Note that there is a caveat, in that we have calculated y by fitting over 
a redshift range (0 < z < 5) which includes the unstable period of this model’s history (z > 0.5) 
during which linear theory breaks down. As emphasised above, these predictions should only 
be compared to data during the stable period. Therefore if this model does describe reality, 
Euclid may measure a different growth rate at higher redshifts; a nonlinear analysis is required 
to answer this with certainty. 

We next look at the modified gravity parameters ri[z, k] and Q(z, k). In figures 4.3 and 4.4 
they are plotted with respect to z, Bi, and k, respectively, wdth the other two quantities fixed. 
Q deviates from the GR value Q = 1 by ~ 0.05, while rj deviates from GR by up to ~ 0.15. From 
the first panel of figure 4.3 we notice that Q and rj lose their dependence on Bi momentarily 
around z ~ 2.5. This feature persists to other values of Bi as well. Additionally, we can see 
from the third panel that Q and rj only depend extremely weakly on k in the linear subhorizon 
regime. Future structure experiments like Euclid will be able to constrain Q and rj more tightly 
in a model-independent way because they are effectively scale-independent; in particular. 
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k = 0.1 h/Mpc 



1.1 1.2 1.3 1.4 1.5 1.6 1.7 


Figure 4.2: First panel: The growth rate, / = d\n5/d\na, for the SNe hest-fit parameter, 
Bi — 1.35 (in hlack), and for the SNe/BAO/CMB combined hest-fit parameter, Bi — 1.45 (in 
red). The full growth rate (solid line) is plotted alongside the W parametrisation (dotted line) 
with best fits j — 0-46 and 0.48 for Bi = 1.35 and 1.45, respectively. Second panel: The best-fit 
value of y as a function of 6i. For comparison, the GR prediction (y 0.545) is plotted as a 
black horizontal line. The blue lines correspond to the best-fit values of Bi from different 
background data sets. This is a prediction of a clear deviation from GR. 
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because of scale independence they are expected to be able to measure rj within 10% [147], 
which would bring this minimal model to the cusp of observability. 

Two-Parameter Models 

At the background level, four models with two nonzero B, parameters provide good fits to 
the data: 8 ^ 82 , 8 ^ 83 , 8184 , and 8283 [80]. Even though these models all possess a two- 
dimensional parameter space, only an effectively one-dimensional subspace matches the 
background data (cf. figures 7 and 8 of Ref. [80] and figure 4 of Ref. [85]). We will restrict 
ourselves to those subspaces by fixing one B, parameter in terms of another, usually Bi. We 
do this by identifying the effective present-day dark energy density, from the Friedmann 

equation (2.51): 

= Biyo + 82 yQ + - Bay^, (4.20) 

and plugging that into the quartic equation for y (2.58), evaluated at the present day (y = yo) 
and using ^ 1^,0 + = 1- This procedure fixes one B/ parameter in terms of the other and 

The value of can be determined by fitting to the data, as was done in Ref. [80] . However, 
for finite-branch solutions of the Bi B; models we can also simply take the limit in which only 
Bi is nonzero, recovering the single-parameter model discussed above; by then using the 
SNe/CMB/BAO combined best-fit value Bi = 1.448 we find = 0.699. 

A detailed study of the conditions for a viable background was undertaken in Ref. [85]. 
There are two results that are particularly relevant for the present study and bear mentioning. 
First, a viable model requires Bi > 0. Second, the two-parameter models are all (with one ex¬ 
ception, which we discuss below) finite-branch models, in which, as discussed in section 2.1.3, 
y evolves from 0 at z = 00 to a finite value yc at late times, which can be determined from 
equation (2.59) by setting p = 0 at y = yc. Consequently, the present-day value of yo, which 
is generally simple to calculate, must always be smaller than yc (or above yc in a model with 

®This does not need to coincide with the value of JIa derived in the context of ACDM models. For the Bi-only 
model and hence all the two-parameter finite-hranch models, they happen to be similar in value, although this 
was not a priori guaranteed, while in the infinite-branch Bi B 4 model, the best-fit value to the background data 
isnff=0.84l[;;“ [85], 
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k = 0.1 h/Mpc 



Figure 4.3: The modified gravity parameters Q (modification of Newton’s constant) and rj 
(anisotropic stress) in the 5i-only model as functions of z and B^. They exhibit ^(10“^)- 
^(10“i) deviations from the GR prediction, which will be around the reinge of observability of 
a Euclid-like mission. 
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B, = 1.45 



k [h/Mpc] 

Figure 4.4: The modified gravity parameters Q (modification of Newton’s constant) and rj 
(anisotropic stress) in the Bi-only model as functions of scale. They depend only very 
weakly on scale; consequently, more stringent constraints can be placed on them in a model- 
independent way by future surveys. 

an infinite branch). In the Bi B 3 and Bi B 4 models this will rule out certain regions of the 
parameter space a priori. 

There is one two-parameter model which was shown in Ref. [80] to fit the background data 
well but was ruled out on theoretical grounds in Ref. [85]: the B 2 B 3 model. The theoretical 
issue is that y becomes negative in finite time going towards the past. This itself may not 
render the model observationally unviable, as long as any issues occur outside of the redshift 
range of observations. However, we have solved equation (2.61) numerically for y(z), and 
found that y generically goes to -00 at finite z, which means that at higher redshifts there 
is not a sensible background cosmology at all. This problem can be avoided by introducing 
new physics at those higher redshifts to modify the evolution of y, or by increasing B 2 enough 
that the pole in y occurs at an unobservably high redshift.However, these are nonminimal 
solutions, and so we do not study the B 2 B 3 model. 

^°For B 2 = (5,50,500), and B 3 chosen to give an effective 0.7 today, the pole occurs at z 

(1.99,8.19,27.95). 
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Recall from chapter 3 that all of the two-parameter models except for the infinite-branch 
B 1 B 4 model suffer from an early-time instability. Consequently, caution should be used when 
applying the results for any of the other models in this section to real-world data. We empha¬ 
sise again that our quasistatic approximation is only valid at low redshifts, and that moreover 
the growth rate should be recalculated using whatever initial conditions the earlier period 
ends with. (The predictions for Q and rj do not depend on solving any differential equation 
and therefore apply without change.) Modulo this caveat, we present the quasistatic results 
for the unstable models as proofs of concept, as examples of how to apply our methods. The 
infinite-branch Bi B 4 predictions, presented at the end of this section, can be straightforwardly 
applied to data. 

We evaluate all quantities at A: = 0.1 fi/Mpc. The modified gravity parameters in all of the 
two-parameter models we study depend extremely weakly on k, as in the Bi-only model. 

As we have already mentioned, in these models the two Bi parameters are highly de¬ 
generate at the background level. One of the main goals of this section is to see whether 
observations of LSS have the potential to break this degeneracy. 


B 1 B 2 : The Bi B 2 models which fit the background data [80] live in the parameter subspace 


S2 = 


-Bl + - /Bf+9^^^ 


(4.21) 


with 0.7. This line has a slight thickness because we must rely on observations to fit 
We can subsequently determine yo from equation (4.20). 

This model possesses an instability when B 2 < 0.^^ This is not entirely unexpected: the 
B 2 term is the coefficient of the quadratic interaction, and so a negative B 2 might lead to a 
tachyonic instability. However, the instability of the Bi B 2 model is somewhat unusual: in 
the subhorizon limit, Q and rj develop poles, but they only diverge during a brief period 
around a fixed redshift, as shown in the first panel of figure 4.5, regardless of wavenumber 

^ W similar singular evolution of linear perturbations in a smooth background has been observed in the 
cosmology of Gauss-Bonnet gravity [159, 160]. This instability is different from the early-time instabilities 
discussed in chapter 3, as those do not arise in the quasistatic limit which we are now taking. 
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or initial conditions. We can find these poles using the expressions in appendix B. The exact 
solutions are unwieldy and not enlightening, but there are three notable features. First, as 
mentioned, the instability only occurs for 62 < 0. (When B 2 > 0, these poles occur when y is 
negative, which is not physical.) Second, the instability develops at high redshifts, z>2. The 
redshift of the latest pole (which can be solved for by taking the limit k/H 00 ) is plotted as a 
function of Bi in the second panel of figure 4.5. As a result, measurements of Q and ry at z < 2 
would generally not see divergent values. However, such measurements would see the main 
instabilities at much lower redshifts. Finally, the most recent pole occurs at y = 0 for 62 = 0 
(Bi = y1.45), and approaches y =yo/2 as B 2 —> -00 (Bi —> 00 ). 


This particular instability is avoided if we restrict ourselves to the range 0 < Bi < 1.448, for 
which B 2 > 0. Some typical results for this region of parameter space are plotted in figures 4.6 
and 4.7. The first panel of figure 4.6 plots /(z) and the best-fit parametrisation for selected 
values of Bi [with B 2 given by equation (4.21)], while the second panel shows the best-fit value 
of y as a function of B^. For smaller values of Bi this parametrisation fits / well, more so than 
in the Bi-only model discussed in section 4.2.2 (which is the Bi = 1.448 limit of this model). 
We find that y is always well below the GR value of y 0.545, especially at low B^. 


In figure 4.7 we plot Q and rj, both in terms of Bi at fixed z and in terms of z at fixed Bi. In 
comparison to the Bi-only model (at the far-right edge in the first panel), lowering Bi tends to 
make these parameters more GR-like, except for rj evaluated at late times (z ~ 0.5), which dips 
as low as J 7 ~ 0.6. Because these quantities are all A:-independent in the linear, subhorizon 
regime, future LSS experiments like Euclid would be able to measure rj at these redshifts 
to within about 10% [147] and thus effectively distinguish between ACDM and significant 
portions of the parameter space of the B 1 B 2 model, testing the theory and breaking the 
background-level degeneracy. 
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Figure 4.5: First panel: Q and rj for a few parameter values in the instability range of the 
Bi B 2 model. Generally there are two poles, each of short duration. Note that the redshift at 
which the pole occurs depends only on 61 and 62 and not on the initial conditions for the 
perturbations. Second panel: The redshift of the most recent pole in Q (the pole in rj occurs 
nearly simultaneously) in terms of Bi > 1.448. This parameter range corresponds to Bz < 0, cf. 
equation (4.21). The minimum is at (5i, B 2 ,z) = (3.11, —2.51,2.24). 
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k = 0.1 h/Mpc 



Figure 4.6: Growth-rate results for the B 1 B 2 model, with = 0.699 and Bi < 1.448. The 
significant deviation from GR in j should be observable by a Euclid-like experiment. Moreover, 
it has the potential to break the degeneracy between Bi and 62 when fitting to background 
observations. 
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k = 0.1h/Mpc 
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Figure 4.7: The modified gravity parameters Q and rj for the B 1 B 2 model, with = 0.699 
and Bi < 1.448. As with the growth rate in figure 4.6, we notice significant deviations from GR. 
The anisotropic stress, r]{z ~ 0.5), is a particularly good target for observations. 
























106 


Chapter 4. Linear Structure Growth in Massive Bigravity 


B 1 B 3 : The B 1 B 3 models which are consistent with the background data [80] lie along a 
one-dimensional parameter space (up to a slight thickness) given hy 




32Bl + 8lBinf±^ {8Bl-27nf) [l6Bf + 27nf) 
243(llf)2 


(4.22) 


with 0.7. There is a subtlety here: the physical branch is constructed in a piecewise 
fashion, taking the + root for Bi < {312)^1^ — 1.536 and the - root otherwise [85]. We 

solve for y (z) using the initial condition,derived from equation (2.58), 


To 




(4.23) 


As discussed at the beginning of this section, yo should not be larger than the value of y in the 
far future, yc. Demanding this, we find a maximum allowed value for 63 ,^^ 

B3< ^ (^32^1+8151 + ^(1652 + 27) (8Bf-27)^j . (4.24) 

For = 0.699 this implies we need to restrict ourselves to Bi > 1.055. This sort of bound is 
to be expected: we know that the 63 -only model is a poor fit to the data [80], so we cannot 
continue to get viable cosmologies the entire way through the Bi ^ 0 limit of the Bi B 3 model. 

We plot the results for the B 1 B 3 model in figures 4.8 and 4.9. These display the tendency, 
which we will also see in the B 1 B 4 model, that large ] 6 /] values lead to modified gravity 
parameters that are closer to GR. For example, y can be as low as y 0.45 for the lowest 
allowed value of Bi, but by Bi ~ 3 it is practically indistinguishable from the GR value, 
assuming a Euclid-like precision of ~ 0.02 on y [87]. Again we note that this value of y has 
been obtained assuming 0 < z < 5, which is not a valid range for observations because of the 
early-time instability. For lower values of Bi, current growth data (see, e.g.. Ref. [153]) are not 

i^There is also a positive root, but this is not physical. When < 0, that root yields yo < 0. When B 3 > 0, 
which is only the case for a small range of parameters, then the positive root of yo is greater than the far-future 
value yc and hence is also not physical. 

i^Note, per equation (4.22), that this is equivalent to simply imposing < 1, which must be true since we 
have chosen a spatially-flat universe a priori. 
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sufficient to significantly constrain the parameter space, but these non-GR values of y and rj 
should be well within Euclid’s window. 


B 1 B 4 : This model comprises the lowest-order Bi term in conjunction with a cosmological 
constant for the / metric, Note that B 4 does not contribute directly to the Friedmann 
equation (2.51), but only affects the dynamics through its effect on the evolution of y. 

The B 1 B 4 model has two viable solutions for y(z): a finite branch with 0 < y < yc, and 
an infinite branch with y^ < y < 00 . The infinite-branch model is the only two-parameter 
bimetric model which is linearly stable at all times, as shown in chapter 3. Therefore this 
should be considered the most viable bimetric massive gravity theory to date. In this section 
we will elucidate its predictions for subhorizon structure formation. 

As discussed in the beginning of this section, yc is the value of y in the asymptotic future 
and can be calculated by setting p = 0 in equation (2.59). We will consider the two branches 
separately. 

For a given B 4 is related to Bi by 


64 = 


- Bf 


(nfr 


(4.25) 


while yo is given by 


neff 

yo = -^. (4.26) 

Bi 

Background viability conditions impose Bi > 0 for both branches and B 4 > 0 on the infinite 
branch [85]. The late-time value of y, y^ is determined by 


B4y^"-3Biy2 + Bi = 0, (4.27) 

from which we can determine that real, positive solutions for y^ only exist if B 4 < 2Bi. Com¬ 
bined with equation (4.25) and the requirements that Bi, > 0, we find two possible regions 

’^In the singly-coupled version of massive bigravity we are studying, matter loops only contribute to the 
g-metric cosmological constant, Bg. 
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k = 0.1 h/Mpc 



Figure 4.8: Growth-rate results for the B 1 B 3 model, with = 0.699 and Bi > 1.055. While Bi 
and 63 are degenerate in the background (along a line given by equation (4.22)), perturbations 
clearly can break this degeneracy, with significant deviations at low values of Bi (small IB 3 I), 
and GR-like behaviour at large values of Bi (large, negative B3). 
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k = 0.1h/IVIpc 



k = 0.1 h/Mpc 
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Figure 4.9: The modified gravity parameters Q and rj for the Bi model, with = 0.699 and 
Bi > 1.055. As with the growth rate, we notice deviations from GR for small Bi (small IR 3 I), 
with parameters approaching their GR values for large Bi (large, negative B 3 ). 
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for Bi, as can be seen from the example plotted in the first panel of figure 4.10. We can 
identify each of these two regions with the two solution branches by comparing the solutions 
of equation (4.27) for yc to yo = Restricting to positive, real roots of yc one can see 

(graphically, for example) that the first region, with the smaller values of Bi, has yo > yc for 
all roots of yc, and hence can only comprise infinite-branch solutions, while yo < yc in the 
second region, as is plotted in the second panel of figure 4.10. This identification is supported 
by observational data [85] and also makes intuitive sense. Consider the limit 64 = 0. In the 
second region this has Bi > 0 and corresponds to the Bi-only model, a finite-branch model, 
which we discussed in section 4.2.2. In the first region the point B 4 = 0 coincides with Bi — 0, 
which is simply a CDM model with no modification to gravity, in agreement with the fact that 
there should not be an infinite-branch Bi-only model. 

These considerations place constreiints on the allowed ranges of Bi, as in the B 1 B 3 model, 
which depend on the best-fit value of 11^^. The Bi-only model (64 = 0, 61 > 0) is on the 
finite branch, so that on that branch we can use = 0.699 as we did in the other models. 
This implies Bi > 1.244 for the finite branch. On the infinite branch, SNe observations are 
best fit by = 0.84 [85]; consequently we restrict ourselves to Bi < 0.529 for the infinite 
branch. Note that the infinite-branch model therefore predicts an unusually low matter 
density, flm,o ^ 0.18. 

We plot the results for the finite branch in figures 4.11 and 4.12. Qualitatively, this model 
predicts subhorizon behaviour quite similar to that of the 6163 model, discussed above and 
plotted in figures 4.8 and 4.9. 

Now let us move on to the stable infinite-branch model, the modified gravity parameters 
of which are plotted in figures 4.13 and 4.14. This is the only model we study which does 
not possess a limit to the minimal Bi-only model, and it predicts significant deviations from 
ACDM. In this model, rj deviates from 1 by nearly a factor of 2 at all observable epochs and 
for all allowed values of Bi, providing a clear observable signal of modified gravity. This is a 
significant feature of this model; it has no free parameters which can be tuned to make its 
predictions arbitrarily close to ACDM and therefore is unambiguously testable. 
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Figure 4.10: Allowed regions in the B 1 -B 4 parameter space. First panel: Regions in parameter 
space with B 4 < Bi, which is required for the viability of the background. The orange line fixes 
B 4 as a function of Bi for = 0.84, per equation (4.25). The blue line is B 4 = 2Bi. Second 
panel: Plotted are yo (orange line) and the three roots of yc, again with = 0.84. Of the two 
regions with B 4 < Bi, we can see that the region with smaller Bi has yo > yc and therefore 
corresponds to the infinite branch, while the region with larger Bi has yo < yc and therefore 
possesses finite-branch solutions. 
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k = 0.1 h/Mpc 
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k = 0.1 h/Mpc 



Figure 4.11: Growth-rate results for the B 1 B 4 model on the finite branch, with 11^® = 0.699 and 
Bi > 1.244. The behaviour is quite similar to that of the B 1 B 3 model, plotted in figures 4.8 
and 4.9. 
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k = 0.1h/IVIpc 



k = 0.1 h/Mpc 
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Figure 4.12: The modified gravity parameters Q and rj for the 8^84 model on the finite branch. 
Results are displayed for = 0.699 and 81 > 1.244. 
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k = 0.1 h/Mpc 



Figure 4.13: Growth-rate results for the B 1 B 4 model on the infinite branch, with = 0.84 
and Bi < 0.529. This model is qualitatively different from the others we study, as it does not 
possess a limit to the minimal Bi-only model, and has the strongest deviations from ACDM 
among all of the models presented in this work. Flere we plot f[z) as well as the best-fit 
parametrisation with the fitting done over the redshift ranges 0 < z < 1 and 0 < z < 5. 
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k = 0.1h/Mpc 





k = 0.1 h/Mpc 
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Figure 4.14: The modified gravity parameters Q and rj for the 6164 model on the infinite 
branch, with = 0.84 and Bi < 0.529. We find strong deviations from GR which will be 
easily visible to near-future LSS experiments. 
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We can understand this behaviour analytically as follows. In the asymptotic past, we can 
take the limits of our expressions for rj and Q to find 

1 2 

lim rj = - and lim Q — -. (4.28) 

N^—co 2 N—*—oo 3 

Unusually, structure growth does not reduce to that of ACDM even though at early times the 
graviton mass is very small compared to the Hubble scale. In the future one finds p —> 1 if 
k is kept finite, but this is somewhat unphysical: for any finite k there will be an epoch of 
horizon exit in the future after which the subhorizon QS approximation breaks down. We 
can see both the asymptotic past and asymptotic future behaviours in the second panel of 
figure 4.14, although the late-time approach of rj to unity is not entirely visible. 

The growth rate and index, f[z] and y, also deviate strongly from the ACDM predictions. 
Using the Dm parametrisation, we find that y is even lower than the range ~ 0.45-0.5 which 
we found in the Bi-only model and in the other two-parameter models. However, the 
parametrisation is an especially bad fit to f[z) in this case; we fit y to f[z) in the redshift 
range 0 < z < 5 (as in the rest of this chapter) and 0 < z < 1 (which is the redshift range of 
present observations [153]) in figure 4.13, obtaining significantly different results and still 
never agreeing well with the data. 

As shown in figure 4.15, the confidence region obtained from the growth data is in agree¬ 
ment with type la supernovae (SNe) data (see Ref. [85] for the likelihood from the SCP Union 
2.1 Compilation of SNe la data [148]). The growth data alone provide ySi = 0.40+o Jg and 
Pa — 0-67;';o 38 with a xIah — (with 9 degrees of freedom) for the best-fit value and is in 
agreement with the SNe la likelihood. The likelihood from growth data is, however, a much 
weaker constraint them the likelihood from background observations. Thus, the combination 
of both likelihoods, providing Pi = 0.48^o;?6 Pa — 0.94;'(o gj, is similar to the SNe la result 
alone. 

In figure 4.16 we compare the growth rate directly to the observational data compiled in 
Ref. [153], using the best-fit values determined above. The available growth data are unable 
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Figure 4.15: Likelihoods for Bi and 64 . The red, orange, and light-orange filled regions 
correspond to 68 %, 95% and 99.7% confidence levels using growth data alone. The hlack 
( 68 %) and gray (99.7%) regions illustrate the comhination of the likelihoods from measured 
growth data and type la supernovae. The blue line indicates the degeneracy curve given hy 
equation (4.25) with the best-fit value of The viability condition enforces the likelihood 
to vanish when ^4 > 2y3i. 

to distinguish between the infinite-branch model and ACDM. We also find that an alternative 
parametrisation. 



(4.29) 


is able to provide a much better fit to f[z) than the usual rtm. The best-fit values for this 


parametrisation are yo — 0.47 and a = 0.21. 

4.3 Summary of Results 

In this chapter we have examined the evolution of cosmological perturbations on subhorizon 
scales in massive bigravity, describing linear structure formation during the matter-dominated 
era. We solved the linearised Einstein and conservation equations in the quasistatic, sub¬ 
horizon limit in Newtonian gauge for In this limit, we found that the perturbations are 
described by a system of six algebraic equations for five variables. We obtained a consistent 
solution to this system relating each of the metric perturbations appeeiring in the subhorizon 
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Figure 4.16: Growth history for the best-fit infinite-branch P 1 P 4 model (solid blue) with 
Pi = 0.48 and Pi = 0.94 compared to the result obtained from the best fit (4.29) (solid orange) 
with Yo — 0.47 and a = 0.21 and the ACDM predictions for rtmfi = 0.27 (dotted red) and 
= 0.18 (dotted-dashed green). The latter value for the matter density is the same as 
is predicted by the infinite-branch model. Note that a vertical shift of each single curve is 
possible due to the marginalisation over erg. Here we choose erg for each curve individually 
such that it fits the data best. The growth histories are compared to observed data compiled 
by Ref. [153]. 


Einstein equations to the matter density contrast. This allowed us to derive a modified evo¬ 
lution equation for the density contrast, which differs from its GR counterpart by a varying 
effective Newton’s constant. We also obtained algebraic expressions for the anisotropic stress, 
ri[z, k], and the parameter measuring modifications to Newton’s constant in the Poisson equa¬ 
tion, Q(z, k), purely in terms of background quantities. We then solved for the background 
numerically to obtain these parameters, and finally integrated the structure growth equation 
to derive the growth rate, /(z, k], and its best-fit parametrisation, / ~ 

We have studied every subset of the theory which is viable at the background level and 
contains one or two free parameters, excluding the g-metric cosmological constant as we 
are interested in self-accelerating theories. Among the single-parameter models, only the 
case with the lowest-order interaction term, Pi / 0, is in agreement with the background 
data. As emphasised in Refs. [80, 85,141], this “minimal” bigravity model is especially appeal¬ 
ing because it possesses late-time acceleration and fits the background data well with the 
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same number of free parameters as ACDM. We have found that it predicts modified gravity 
parameters that differ significantly from GR: y ~ 0.46-0.48 (in agreement with Ref. [142]), 
Q ~ 0.94-0.95, and iq ~ 0.88-0.90. For reference, the ACDM predictions are y 0.545 and 
Q — rj — 1. Future large-scale structure experiments such as Euclid [87, 88 ] will easily be 
able to distinguish this simple model from GR, if we can trust its predictions. Flowever, we 
have shown in chapter 3 that this model suffers from an early-time instability. If this can be 
overcome, or if observations are restricted to late times (z < 0.5), our results demonstrate that 
by going to the level of linear perturbations, this theory can be probed in the near future by 
multiple observables which deviate significantly from general relativity. 

We additionally examined the four two-parameter models which are viable in the back¬ 
ground, all of which keep ySi > 0 while turning on a second, higher-order interaction term. 
Two of these models, in which either the cubic interaction, ySa, or the /-metric cosmological 
constant, ^ 4 , is nonzero (the latter specifically in the “finite branch,” which reduces to the 
minimal model in the ^4 ^ 0 limit), have similar behaviour to each other. They predict 
GR-like values for all three modified gravity parameters in the limit where m^Pi/H^ is large, 
becoming indistinguishable from AGDM (given a Euclid-like experiment) for > 3. 

These reduce to the predictions of the minimal model in the limit 1.45 (the best-fit 

value for in the minimal model). Eor lower values of (corresponding to positive /a or 
P4) these models predict even more dramatic deviations from GR: y can dip to 0.45, and q 
at recent times can be as low as ~ 0.75. Euclid is expected to measure these parameters to 
within about 0.02 and 0.1, respectively [87, 88 , 147], and therefore has the potential to break 
the degeneracies between and /a or and ^4 which is present at the level of background 
observations. 

The / 1/2 model has an instability when Pz, the coefficient of the quadratic term, is nega¬ 
tive. This instability does not necessarily rule out the theory, but it might signal the breakdown 
of linear perturbation theory, in which case nonlinear studies are required in order to under¬ 
stand the formation of structure. This is different from the early-time instability discussed 
in chapter 3 which does not show up in the subhorizon, quasistatic regime. It is possible 
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that these perturbations at some point become GR-like due to the Vainshtein mechanism. 
Moreover, because the instability occurs at a characteristic redshift (which depends on the yS; 
parameters), there may be an observable excess of cosmic structure around that redshift. 

The parameter range of the P 1 P 2 model over which this instability is absent, 0 < ySi < 
lAbHl/m'^ (corresponding to H^/m^ > yS 2 > 0), is quite small; near the large ySi end the predic¬ 
tions recover those of the minimal ySi-only model, while at the small ySi end the perturbations 
can differ quite significantly from GR, with y as low as ~ 0.35 and rj as low as 0.6. However, 
the exact Pi — 0 limit of this theory is already ruled out by background observations [80], so 
one should take care when comparing the model to observations in the very low ySi region of 
this parameter space. 

Finally, we examined the “infinite branch” of the ^ 1^4 model, which is the only bimetric 
modcF^ that avoids the early-time instabilities uncovered in chapter 3. This is called an 
infinite-branch model because the ratio of the / metric scale factor to the g metric scale 
factor, y, starts at infinity and monotonically decreases to a finite value. In the rest of the 
models we study, y starts at zero and then increases; consequently, in the ^ 4^0 limit this 
theory reduces to pure CDM, rather than to the ySi-only model. 

The predictions of the infinite-branch P 1 P 4 theory deviate strongly from GR. The model 
predicts a growth rate /(z) which is not well-parametrised by a fit, but has best-fit values 
of y on the low side (0.3-0.4, depending on the fitting range). The anisotropic stress rj is 
almost always below 0.7 and can even be as low as 0.5, a factor of two away from the GR 
prediction. Across its entire parameter space, this model has the most significantly non-GR 
values of any we study. Its predictions should be well within Euclid’s window. 

As the only sector of the massive bigravity parameter space which both self-accelerates 
and is always linearly stable around cosmological backgrounds, it is a prime target for obser¬ 
vational study especially because its observables are far from those of ACDM for any choices 
of its parameters. 


to the addition of a cosmological constant, which is uninteresting. 




O, that way madness lies; let me shun that; 
No more of that. 


Lear, King Lear, 3.4 
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The Geometry of Doubly-Coupled Bigravity 


The existence of a consistent bimetric theory raises an intriguing question: which is the 
physical metric? In chapters 3 and 4 we chose to couple only one of the two metric, 
directly to matter, while the other dynamical metric, only interacts with matter fields 
indirectly through its interactions with It is specifically in this singly-coupled context 
that the absence of the Boulware-Deser ghost was originally proven [46-48] , and extending 
that analysis to other matter couplings is decidedly nontrivial [72, 73,161,162], It is therefore 
natural to interpret g^v in this case as the usual “metric” of spacetime, while is an extra 
spin-2 field that is required in order to give mass to the graviton. Since the fields g^v and 
fur have metric properties, we have called both g^^ and f^v “metrics,” even though strictly 
speaking, the singly-coupled theory could more accurately be called a theory of “gravity 
coupled to matter and a symmetric 2-tensor” [75]. 

In this chapter we aim to explore the consequences of coupling matter to both metrics in 
massive bigravity. As discussed in section 2.1.2, the bimetric action (2.30) in vacuum places 
both metrics on equal footing: it is invariant under the interchanges (2.38) 
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This metric-interchange duality is broken by the addition of matter in the singly-coupled 
version of bigravity. The structure of the vacuum theory might hint that any fundamental 
theory which gives rise to massive bigravity does not discriminate between the two metrics. 
Consequently it is important to explore doubly-coupled bigravity, in which matter couples to 
both metrics in a way that maintains the interchange duality (5.1). 

We are specifically interested in the possibility of double-coupling schemes in which there 
is no “effective metric” whose geodesics describe the motion of matter. In these cases, we 
must introduce new tools for understanding the physical geometry of spacetime. We will 
focus on one of the most straightforward possibilities for double coupling, in which matter is 
minimally coupled to each metric. This theory was introduced and its cosmological solutions 
were studied in Ref. [77]. Subsequent work showed this type of coupling revives the Boulware- 
Deser ghost at arbitrarily low energy scales and therefore cannot be a fundamental theory of 
bigravity [72, 73] . Indeed, there is only one known double coupling which avoids the ghost 
at low energies, and even in this theory it re-emerges at or above the strong-coupling scale 
[73,163].^ This coupling is the phenomenologically interesting one and will be investigated 
in depth in chapters 6 and 7. These theories couple matter to a single effective metric built 
out of g^y and f^y and so avoid the problem of determining the physical spacetime. It is, 
however, not yet clear whether such theories are immune from other types of pathologies, and 
it may well be the case that the unknown, healthy doubly-coupled theory of bigravity will not 
admit an effective-metric formulation at all. It is our goal in this chapter to demonstrate the 
difficulties such theories would have with regards to defining observables by studying arguably 
the simplest example of doubly-coupled bigravity without an effective-metric description. In 
this context, the traditional notion of a “physical metric” may have to be discarded, leaving 
us faced with entirely new conceptual challenges in interpreting even the observables of the 
theory. 

This chapter is organised as follows. In section 5.1 we argue that there is no effective 
metric to which matter minimally couples, and that such a metric does not even exist for most 

^Using the same principles, further candidate double couplings have been constructed in Ref. [164], but it is 
not yet known which, if any, of these are ghost-free. 
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individual fields. In section 5.2 we explore light propagation in this theory in the geometric 
optics limit, and discuss the problem of relating cosmological ohservahles to the underlying 
theory when we can no longer descrihe photon trajectories as null geodesics in a metric. In 
section 5.3 we examine the dynamics of point particles, finding that they effectively live in a 
Finsler spacetime, a geometry which depends nontrivially on the coordinate intervals. Finally, 
we conclude in section 5.4. 


5.1 The Lack of a Physical Metric 

We consider a douhly-coupled himetric theory in which the action (2.30) is extended hy the 
addition of a minimal-coupling term between matter fields, and 


M2 

g 


JRR ■ 


r r _^ 


(5.2) 


This extends the symmetry (5.1) to the entire action, as long as we also exchange ffg and a/. 


Sfiv^ Pn ^ Pa-ih ^ 


(5.3) 


The presence of the interaction term is crucial; if one were to couple two pure, noninteracting 
GR sectors to the same matter, the Bianchi identities would constrain that matter to be entirely 
nondynamical [77]. Note that Ug and a/ are not both necessary to fully specify the theory; 
only their ratio is physical, as can be seen by rescaling the action by a“h For the purposes of 
this chapter, we will find it useful to leave both in so as to keep the symmetry between the two 
metrics explicit. 

An immediate concern is the violation of the equivalence principle. However, because the 
Vainshtein mechanism screens massive-gravity effects [54], it is not obvious how stringent 
the constraints from tests of GR in the solar system would be: the modifications might be 
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hidden from local experiments while showing up at cosmological scales. The cosmology 


of this douhly-coupled theory has been studied and shown to produce viahle late-time 


accelerating background expeinsion without an explicit cosmological constant term [77], and 
with a phenomenology which can be interestingly different from that of the singly-coupled 
theory [80]. We emphasise again that this model itself possesses the Boulware-Deser ghost 
and hence we cannot trust its cosmological solutions, but a ghost-free doubly-coupled theory 
may well have similar properties. Indeed, the cosmological phenomenology of this theory is 
quite similar to that of the healthier doubly-coupled theory introduced in Ref. [73], as we will 
show in chapter 6. 

We can readily confirm that no physical Riemannian metric exists in the sense that all 
matter species would minimally couple to it and thus follow its geodesics. Indeed, for some 
matter fields such a metric does not exist at all. Consider a massive scalar field. Its action is 
given by 



Let us assume that (p is minimally coupled to an effective metric, [g/iv> fjiv] - This metric is 
defined through the relation 



(5.5) 


where we have defined the various scalar Lagrangians as 


hT'’di,(j)dy(j) - Vicp). 


(5.6) 


The kinetic and potential terms, respectively, yield the conditions 



(5.7) 



(5.8) 
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These are not necessarily consistent with each other: taking the determinant of equation (5.7), 
we find 

+ (5.9) 

so that equations (5.7) and (5.8) overdetermine unless g^y and f^y satisfy the nontrivial 
relation 

+ =-det(ag/^g'^'' + a/-/^/'''’). (5.10) 

However, even the simplest case, g^v = f/iv — fails this test, as well as more complicated 
but physically-relevant cases like FLRW. 

Therefore, any choice of will generally result in either a noncanonical kinetic term or a 
spacetime-varying mass for the scalar. For some special choices of g^v and f^y, particularly if 
they are related by a constant conformal factor, then this will only rescale the mass or kinetic 
term by a constant amount. In this particular theory, however, such a relation between g^y 
and f^y is far from general [77]. 

If the scalar field is massless, Vicp] = 0, then we lose the constraint (5.8). Massless scalars 
therefore do have physical metrics defined by equation (5.7). As a consistency check, we can 
confirm that the Klein-Gordon equation in 


□zi^ =0, 


(5.11) 


yields the correct massless Klein-Gordon equation [77] 

+ (5.12) 

This is straightforward to show using the identity g^^'^T^y — --^dn[^—gg>^P). This is the 
only example we have found of a field for which we can construct an effective metric. 


Consider the electromagnetic field A^. This is of paramount importance for cosmology. 
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since we make observations by tracking photons. Its action is 

F^yFafi, (5.13) 

where F^y — d^Ay — dyA^, is the usual field-strength tensor and does not depend on a metric. If 
A^ is minimally coupled to an effective metric, h^y, then we can write equation (5.13) as 

SA^-^^d^x^h>^'^h^PF^yFap. (5.14) 

This implies that h^y obeys 


+ (5.15) 

However, this equation overconstrains Consider the 00-00, 00-/1, and ii-ii components, 

«g/^ + “/-/^ (/“)^ = {h°°y, (5.16) 

«gV^g“g"(5.17) 


where repeated indices are not summed over. Solving for and h" using equations (5.16) 
and (5.17), equation (5.18) becomes a constraint on g^y and f^y, 

g00fii^f00gii_ 


Note that we have chosen an arbitrary spatial index, /, in an arbitrary coordinate system; 
equation (5.19) therefore applies to any diagonal spatial component in any coordinates. This 
equation is not satisfied by general choices of g^y and f^y. An FLRW universe is a simple 
example where this condition fails to be satisfied. Thus, except in special circumstances, there 
is no physical metric for the electromagnetic field. 
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Similar arguments should hold for other fields. The massless scalar appears to he a special 
case because it lacks a potential term to constrain h^v and because it has no indices, so its 
kinetic term only includes one appearance of the metric. 


5.2 Light Propagation and the Prohlem of Ohservahles 

We have shown that the electromagnetic field is not minimally coupled to any effective 
metric. This case is of particular physical relevance because we make observations by tracking 
photons. For cosmological observations, especially, it is crucial to know how light propagates. 

Even in this simplified case, photons turn out not to travel on null geodesics of any metric. 
To see this, we will consider the plane-wave approximation for the Maxwell field, 

A^ = Re[(a^ + ehJe‘'^/^], (5.20) 

and take the geometric optics limit in which the wavelength is tiny compared to the charac¬ 
teristic curvature scale, e = A/i? < 1. This provides a rigorous approach to describing light 
propagation in curved spacetime. In this ansatz, is the leading-order polarisation vector 
and ip is the phase. Herein we will drop the real evaluation for compactness. Because this is a 
“pregeometric” approach, we can utilise it to tackle the propagation of light rays in bigravity. 
The stress-energy tensor for the electromagnetic field, which can be derived from the action 
(5.13), is 

T% = ^ j . (5.21) 

We must be careful about which quantities depend on a metric and which don’t. The field 
tensor is defined as usual in terms of the electromagnetic 4-potential A^, which is itself defined 
with lower indices just in terms of the fields, so is the same in both metrics. Similarly, 
because of the symmetries of the Christoffel symbols, F^,, can be defined equivalently in terms 
of covariant or partial derivatives; because of the latter, we see that F^v with lower indices is 
also independent of the metric. 
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Stress-energy conservation is given by [77] 

T^r + af r;,, = 0, (5.22) 

where and are the covariant derivatives defined with respect to gjj,^ and respec¬ 
tively. To apply this to the electromagnetic field, we first need to know (in terms of g^v, 
for concreteness) the divergence of the stress-energy tensor. The identity V[aF^v] = 0 holds 
independently of the theory of gravity and in either metric, because it relies only on the usual 
expression for the commutation of covariant derivatives and the symmetries of the Riemann 
tensor. Using this, we find 

= (5.23) 

Plugging this into equation (5.22) and using the fact that it should apply for arbitrary F^a 
(because, as mentioned above, this is independent of the metrics), we find a straightforward 
generalisation of the Maxwell equations, 

where g and / subscripts on tell us which metric is being used to raise indices. 

We have yet to use our gauge freedom. We will choose to work in a Lorenz gauge, where 
V = 0. Since this cannot be simultaneously satisfied in both metrics, we will choose to 
apply this gauge with respect to g^y. As we will see shortly, this choice does not make a 
difference at leading order in the geometric optics approximation. After specialising to this 
gauge and commuting some covariant derivatives, the Maxwell equation reduces to 

Plugging the ansatz (5.20) into equation (5.25) and keeping only the leading-order terms in 
e—i.e., those obtained by acting the covariant derivatives on the exponential term twice—we 
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obtain 

a, kakp=o, ( 5 . 26 ) 

where A:^ = d^ip is the wavevector. Note that in the singly-coupled limit, this gives us the 
standard result that ka is null in 

As discussed above, we cannot use this to define a metric, in which ka is lightlike. This 

creates problems when applying the standard methods of relativistic cosmology to compare 
observable quantities to the underlying theory. In a bimetric cosmology, as we have seen, 
there are two scale factors and two Hubble rates. When matter couples to both metrics, 
neither of these quantities plays the role that they play in general relativity. Had we been 
able to identify an effective metric from equation (5.26), then the scale factor of that metric 
would have been the geometrical quantity that entered the expression for the redshift, and its 
Hubble rate (computed using the effective metric’s lapse) would be the “physical” Hubble 
rate. The next step, relating the theoretical redshift to the shift in wavelengths observed by 
a telescope, would involve understanding the proper time of a massive observer, which we 
tackle in the next section. However, equation (5.26) defies the usual, simple categorisation. 
While we can, in principle, use this to compute light propagation, this approach does not 
shed light on the identification of a physical scale factor to compare to observations. 


5.3 Point Particles and Non-Riemannian Geometry 

The situation we have described in bimetric theories is radically different from the extensively- 
studied nonminimally coupled theories where the behaviour of matter can be described in 
terms of a single metric. In the context of scalcir-tensor theories, for example, it is well known 
that there are conformally-equivalent descriptions of the theory where either the gravity 
sector is general relativity whilst matter has a nonminimal coupling (the Einstein frame), 
or matter is minimally coupled whilst the gravity sector is modified (the Jordan frame). All 
physical predictions are completely independent of the frame in which they are calculated 
after properly taking into account the rescaling of units in the Einstein frame [165]. One can 
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generalise to nonuniversal couplings, allowing different Jordan frame metrics for different 
matter species, or to couplings to multiple fields. These bring about new technical but not 
fundamental difficulties. However, the doubly-coupled bimetric theories we are studying do 
not admit a Jordan frame at all for most types of matter. They possess mathematically two 
metrics but physically none, and to understand them we need to step beyond the confines of 
metric geometry. 

For concreteness, let us look at the simplest possible type of matter: a point particle of 
mass m. Its action is given by 



(5.27) 


where overdots denote derivatives with respect to a parameter A along the particle’s trajectory. 
Varying with respect to A, we obtain the “geodesic” equation [77] 



(5.28) 


where Ug = dxi^/dsg is the four-velocity properly normalised with respect to g^v, such that 
gg^yUgU'^ — 1, and is defined analogously for the geometry. In defining Ug and 
we have introduced the line elements for the two metrics, ds^ = g^ydxf^dx'' and dsj = 
fnydx^^dx^’. 

Is equation (5.28) the geodesic equation for a Riemannian metric? In other words, can 
the motion of point particles in this bimetric theory be described as geodesic motion of an 
effective metric? We can gain insight on this question by writing equation (5.27) in the form 



(5.29) 


To see that this is equivalent to equation (5.27), note that we can write 


— idsg^ gg^vUgU''^ — idsg, 


( 5 . 30 ) 
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where we have used the fact that (by definition) UgU''^ = l. Similar logic holds for The 
form (5.29) is less useful calculationally, particularly for deriving the geodesic equation (5.28), 
but it opens up a helpful rephrasing of the question of an effective metric: we want to find a 
line element, ds , for which Spp — —im^ ds. This would imply 


ds^agdsg + afdsf. (5.31) 

Squaring this and plugging back in the definitions of d Sg and dsf,we find that the cross-term 
introduces a non-Riemannian piece, 

ds^— {j^^gg/iv + cc^ffixv^ dx^dx'’ +2agaf\Jgfj^vfapdx^^dx'^dx'^dxP, (5.32) 

and so point particles do not move on geodesics of an effective metric. 


In fact, equation (5.32) is the line element of a Finsler geometry [166, 167]. A Finsler 
spacetime can be defined by the most general line element that is homogeneous of degree 2 
in the coordinate intervals dxi^, i.e.. 


ds^ — f{x^ ,dx^'), (5.33) 

f{x^,Xdx'’) — X^f{x^,dx'^). (5.34) 

The homogeneity property (5.34) conveniently allows us to write the Finsler line element in a 
pseudometric form. Following Bekenstein [167], let us write equation (5.34) taking A = 1 + e 
and e < 1, 


^ ddx!^ 2 ddx^^ddx^ 


dx^dx'’+ 6{e^)—{\+2e + e^^ f, (5.35) 


where / here is shorthand for f[x^, dx^). Taking the piece, we find the line element can 
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be written in terms of a quasimetric 

f — ds^ — ^^nvdx^dx'', (5.36) 


where the quasimetric is defined by 


^ ^ 1 d^f 

Iddx^ddx^' 


(5.37) 


It is worth noting that this is an exact relation, as we have not thrown away any information 
by expanding in e. We have simply used the fact that the condition (5.34) has to be satisfied 
at every single order in e. Indeed, this result equivalently follows from Euler’s theorem for 
homogeneous functions. Euler’s theorem for a multivariate function (in this case, a function 
of each component of dx"^', holding fixed) can be written^ 




1 df 

2 ddx' 


dx^ 


(5.38) 


Differentiating this with respect to dx>^, we obtain 


ddx>^ 


ALs' + 

ddx'' ^ ddx^ddx^ 


dx' 


df 

ddx^ 


d dx^^ddx'' 


dx^'. 


(5.39) 


Plugging this back into equation (5.38), we recover the result (5.36, 5.37). 

Note that the quasimetric can depend on the coordinate intervals, dx>^, which is how it 
differs from the metric of a usual Riemannian spacetime. We can see this by applying the 
equation (5.37) to the definition (5.32) of / to explicitly calculate %v. 




flV 


■Ctlgap+Ct^ffap+CCgaf 


dSg 




ufu-^ 


f j +2uf„u^ 




. (5.40) 


In addition to constant conformal relations to the original metrics, this quasimetric is disfor- 


^Note that this is the &{e) piece of equation (5.35). This form of the line element, / = p^idx^, defines the 
Finsler one-form, [168]. 
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malty related to the particle’s four-velocity. The link between Finsler geometries and disformal 
relations is not new; certain Finsler geometries can be described as Riemannian spacetime 
with matter disformaly coupled to a scalar or vector field, such as a disformal scalar-tensor 
theory where matter couples to the effective metric g^v + [167], Disformal theories of 

gravity have attracted attention recently [169,170] ; it would be interesting if these theories 
turned out to be related to bigravity. 

This formulation in terms of Finsler geometry opens up our understanding of point- 
particle dynamics. For massive particles, we can define the proper time, t, from the line 
element in the usual way, 

= -ds^. (5.41) 


It follows trivially that, in terms of this proper time, massive point particles travel on unit-norm 
timelike geodesics with respect to the quasimetric. 




dx‘^ dx^ 
dz dz 


- 1 . 


(5.42) 


We are also now in a position to extend the action (5.27) to massless particles. This action 
vanishes in the limit m ^ 0 and so is technically only defined for massive particles. In general 
relativity, the geodesic equation does hold for massless particles. This can be seen from the 
fact that m drops out of the geodesic equation, but to show it rigorously, it is common to 
introduce a Lagrange multiplier, often called the einbein. The same logic carries over to our 
bimetric theory uninterrupted. Let us write the action (5.27) in terms of a parameter A and 
introduce the einbein, e(A), as 


S 


1 

2 


dX 


e-\X) 



2 

-I- m^e(A) 


(5.43) 


For m / 0, varying this with respect to e we find 


1 ds 
m dX' 


(5.44) 
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Plugging this into the action (5.43), we obtain the original action, equation (5.27). But we can 
now extend the treatment to m = 0. In this case, varying with respect to e yields 


ds^ = 0. 


(5.45) 


Then, varying with respect to x“, we find the same geodesic equation as for the massive point 
particles. In other words, we have found that massless point particles travel on null geodesics 
of %v We may want to use a different form than (5.28) for the geodesic equation when 
dealing with massless point particles, since in general dsg and dsf may vanish for a massless 
particle.^ We can write the geodesic equation (for a massive or massless point particle) in 
terms of the quasimetric as 



(5.46) 


Note that we do not write this in terms of Christoffel symbols because we do not have to; 
if we had, we would need to calculate the inverse quasimetric, which is both difficult and 
unnecessary. 

This result is straightforward to extend to theories with N interacting metrics corre¬ 
sponding to a massless graviton with a tower of iV - 1 massive gravitons [70, 171]. In this case, 
the line element is defined by 



(5.47) 


which is clearly Finslerian. 


^This will be the case in particular if g^v and /^v are conformally related, as then a lightlike path in one metric 
is also lightlike in the other. 
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5.4 Summary of Results 

We have examined an example of a bimetric theory in which, due to their minimal coupling to 
both metrics, matter fields do not feel a universal physical metric. We have found that when 
coupling matter to multiple metrics, the massless scalar might be unique in minimally cou¬ 
pling to a Riemannian effective metric. The massive scalar. Maxwell field, and point particle 
all provide counter-examples. We examined in detail light propagation in the geometric optics 
limit and showed that there is a distinct problem in relating observations to the underlying 
theory. We can make progress by generalising the line element beyond a Riemannian form. 
In particular, we showed that point particles follow geodesics of a Finsler spacetime, which is 
nonmetric. This geometry that emerges for a pointlike observer depends quite nontrivially 
upon, in addition to the two metric structures, the observer’s own four-velocity through a 
disformal coupling. 

These considerations in this chapter may force us to rethink the geometric nature of 
spacetime, even in a metric theory of gravity. Consider the fundamental question of how to 
relate bigravity to observations, such as cosmological measurements. The textbook methods 
lean heavily on the existence of a “Jordan-frame” metric to which matter is minimally coupled. 
Here, however, such a metric does not exist universally, and may not exist at all for certain 
species of matter. How, then, should one calculate the redshift and the luminosity distance 
of a cosmological source in terms of the underlying FLRW geometries? Even the proper 
time along a timelike path is no longer trivial, as we cannot use the assumption dz — —ds. 
Indeed, because of the different effective metrics (or lack thereof), the notion of proper time 
is likely no longer even unique, depending instead on which matter fields an observer uses to 
construct her clock. 

Perhaps the best approach to solving physical problems in bimetric spacetimes without 
an effective metric is to go back to “primitive,” pre-geometric constructions. In the absence of 
a single spacetime on which to formulate physics, we may need to simply consider particle 
motion coupled to two (or more) spin-2 fields in a way that only looks geometric because it is 
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the nature of the spin-2 particle to invoke geometry [12,13]. 

Paradoxically, once we have doubled geometry, we lose the ability to use its familiar 
methods. This is a call to go back to the basics, and rediscover the justification for results 
which we have taken for granted for the better part of the last century. 



The universe is full of magical things patiently waiting for our wits to grow sharper. 


Eden Phillpotts, A Shadow Passes 

Cosmological Implications of 
Doubly-Coupled Massive Bigravity 



So far we have studied the cosmological solutions of massive bigravity in chapters 3 and 4 
with matter coupled only to one metric, and discussed some of the theoretical issues with 
extending to a bimetric matter coupling in chapter 5. As emphasised in the introduction 
of chapter 5, the singly-coupled theory spoils the metric interchange symmetry present in 
vacuum; the kinetic and mass terms treat the metrics on equal footing, hut this is broken when 
one couples matter to only one metric. It is therefore compelling to investigate other types of 
matter coupling that extend this metric-interchange symmetry to the entire theory. Moreover, 
as demonstrated in chapter 3, cosmological background viability and linear stability rule out 
all but a small handful of the parameter space of the singly-coupled theory. By extending the 
matter coupling, we may be able to open up the space of observationally-allowed bimetric 
theories. 

The most significant obstacle to the construction of such a theory is that almost all 
attempts to couple matter to both metrics, such as the double minimal coupling discussed in 
chapter 5, reintroduce the Boulware-Deser ghost (cf. section 2.1.1) at arbitrarily low energies 
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[71-73]. One of the papers demonstrating this, Ref. [73], also proposed a double coupling 
which is significantly better-behaved with respect to the Boulware-Deser ghost. While a ghost 
does appear in this theory, it appears at a scale at least as high as the strong coupling scale 
and possibly parametrically larger, in which case it is outside the domain of the validity of 
the effective theory. While this may present a problem for highly anisotropic solutions, the 
absence of the ghost around FLRW solutions was demonstrated explicitly [73]. The status of 
the ghost in this specific coupling has also been investigated in Refs. [161, 162,172]. 

In this theory, matter couples minimally to an effective metric constructed out of the two 
metrics appearing in the gravitational sector of the theory, regardless of whether the second 
metric is dynamical. This would alleviate the problem of constructing physical observables 
discussed in chapter 5, as matter would move on geodesics of the effective metric. This 
proposal has been derived using complementary methods and extended to a multi-metric 
framework in Ref. [ 163] , while the cosmology of this new coupling has been investigated in 
the dRGT context in Ref. [173] and will be discussed in chapter 7. 

In this chapter, we study the background cosmology of massive bigravity when matter 
couples to the effective metric proposed in Ref. [73]. We show that the background expansion 
can asymptotically approach ACDM at both early and late times, and for certain parameter 
values is identical to ACDM always. At the background level, this type of coupling is therefore 
consistent with observations. In a future study, we will investigate whether this holds true for 
cosmological perturbations. 

This chapter is organised as follows. In section 6.1 we present the effective metric and the 
symmetries that are present in the action. In section 6.2 we derive the cosmological equations 
of motion and discuss their main features. A parameter scan of the minimal models, where 
only one of the interaction terms is nonvanishing, is performed in section 6.3. In section 6.4 
we discuss some special parameter choices. We conclude in section 6.5. 
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6.1 Doubly-Coupled Bigravity 


In this chapter we will extend the higravity action (2.30) to a douhly-coupled version with an 
effective metric, given hy 


M2 

SHR,dc =-^ 


J d^x/^i?(g)-^ J d^x^/^Rif) 
J n=0 J 


( 6 . 1 ) 


The effective metric, first introduced in Ref. [73], is defined by2 


g^v^O'^gf^v+2aPg^a^"r+P^ff,r, = (VT^fv- (6.2) 


As shown in appendix C, the effective metric is symmetric under the interchange g^v f/iv 
and a<-^ p. This makes the entire action symmetric under the transformations 


gfiv ^ Pn * Pa-uj cl ^ ^P- (6.3) 

There is thus a duality between the two metrics present in the action which is spoiled when 
matter couples to only one of the metrics (taken by setting either a = 0 or yS = 0). 

The effective metric has the convenient property that its determinant is in fact in the form 
of the ghost-free interaction potential in equation (6.1). In particuleir, the determinant can be 
written as [73] 

-/^det(a + ^X). (6.4) 

The right-hand side is a deformed determinant, and it appears naturally when constructing a 
ghost-free potential [45]. Indeed, this deformed determinant is nothing other than a subset of 


^ We will denote the effective metric with “eff” written as a superscript or subscript interchangeably. 

^In Ref. [73] the effective metric is given in an explicitly symmetric form, but this is not needed since 
g^aX“v = gvaX“^, as first shown in Ref. [71]; see also appendix C. 
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the ghost-free dRGT potential with specific choices for p„, 


4 


det[a +px) 


(6.5) 


n=0 


Therefore matter loops, which will generate a term of the form ^-ges^v, will by construction 


not lead to a Boulware-Deser ghost. This simple criterion in fact dooms many other forms of 
double coupling and is, in part, what motivated Refs. [73, 163] to construct this specific form 



The action (6.1) contains two Planck masses (Mg and My), five interaction parameters 
iP„, of which ySo and are the cosmological constants for g and /, respectively), and two 
parameters describing how matter couples to each metric (a and yS). The Planck masses 
and the coupling parameters a and p only enter observable quantities through their ratios. 
Moreover, one of those ratios is redundant: as described in appendix C, the action can be 
freely rescaled so that either My/Mg or p/a is set to unity.^ Therefore the physically-relevant 
parameters are and either My/Mg or p/a. In this chapter we will rescale the Planck masses 
so that there is one effective gravitational coupling strength, Meff. We will also keep a and p 
explicit to make the a<^ p symmetry manifest, but the reader should bear in mind that only 
their ratio matters physically. All observational constraints will be given solely in terms of 
P/a, from which it is straightforward to take the singly-coupled limit, P/a^Q. 

The Einstein equations have been derived in Ref. [174] and can be written in the form 



n=0 g 



n=0 / * ^ 


The matrices Y and Y depend on \/ g and respectively, and are the same as were 


^See also section 2.1.2 for the redundancy of the Planck masses in the singly-coupled theory. 
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defined in equation (2.41). The Einstein tensors Gg'^ and have their indices raised with 
g^y and ff^y, respectively. Note that the terms with geff can be simplified using equation (6.4). 
The stress-energy tensor is defined with respect to the effective metric geff as 


v/-geffi^'m (geff,^)] = -gelfT^''5 


( 6 . 8 ) 


and obeys the usual conservation equation 


yeff J/iV ^ ^g_g^ 

where is the covariant derivative for g®®. 


6.2 Cosmological Equations and Their Solutions 


To describe homogeneous and isotropic cosmologies, we specialise to the Friedmann-Lemaitre- 
Robertson-Walker (FLRW) ansatze for both g^y and f^y, 

ds^^ — -N^^dt^ + a^^dx^, ( 6 . 10 ) 

ds^j: — -N^dt^ + a‘^j:dx^, ( 6 . 11 ) 

where Ngj and Ugj aie the lapses and scale factors, respectively, of the two metrics. Because 
both metrics are on equal footing, we have chemged the notation slightly from chapters 2 to 4 
to be more symmetric between the two metrics. As in general relativity, we can freely rescale 
the time coordinate to fix either Ng or Nf however, their ratio is gauge-invariant and will 
remain unchanged. The effective metric becomes 


dslff — -N^d + a^dx^, 


( 6 . 12 ) 
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where the effective lapse and scale factor are related to those of g^y and f^y by 


N = aNg+pNf, (6.13) 

a — aUg+Paf. (6.14) 

The equations of motion can be derived either directly from equations (6.6) and (6.7), or by 
plugging the FLRW ansatze into the action and varying with respect to the scale factors and 
lapses, as was done in Ref. [73]. We have checked that both approaches yield the same result. 
Defining 


^o(y) = ^0+ 3^iy+3^2y^+^3y^> (6.15) 

^i(y) = ^iy ^+ 3 ^ 2 y ^+ 3 ^ 3 y ^+^ 4 > (6.16) 


where, as before. 


y 


_ Uf 


Uc 


(6.17) 


the Friedmann equations for g^y and f^y are 


^ ap „ 

— 3+^ Bo, 

, Bp „ 

Bi. 


(6.18) 

(6.19) 


Here the energy density, p, is a function of the effective scale factor, a, and we have defined 
the g- and /-metric Hubble rates as 


Hg = 


NgUg 


Hf = 


NfUf 


( 6 . 20 ) 


Notice that the two Friedmann equations for Hg and Hf map into one another under the 
interchange —> / 4 -„, a <—> /, and g^y /^v (which sends Hg <—> Hf, y —> y“/ and 
Bo <—> Bi), as expected from the properties of the action described in appendix C. 
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The stress-energy tensor is conserved with respect to the effective metric, so we immedi¬ 
ately have 



( 6 . 21 ) 


where the density, p, and pressure, p, are defined in the usual way from the stress-energy 
tensor. By taking the divergence of either Einstein equation with respect to the associated 
metric (e.g., taking the g-metric divergence of equation (6.6)) and using the Bianchi identity 
and stress-energy conservation, we obtain the “Bianchi constraint,” 



( 6 . 22 ) 


In complete analogy with the singly-coupled case discussed in section 2.1.3, which can he 
obtained by setting a or yS to zero, equation (6.22) gives rise to two possible branches of 
solutions, one algebraic and one dynamical [79, 81,175].^ 

6.2.1 Algebraic Branch of the Bianchi Constraint 

As discussed in section 2.1.3, in the singly-coupled case, setting the first bracket of equa¬ 
tion (6.22) to zero gives an algebraic constraint on y that can be shown to give solutions that 
are indistinguishable from general relativity at all scales [79]. In the doubly-coupled theory, 
the presence of the pressure term makes the phenomenology of the algebraic branch richer. 

In this section, without any ambition to examine all possible solutions, we briefly outline 
some of the properties of a few specific solutions on the algebraic branch of the Bianchi 
constraint (6.22). In this branch we have 



(6.23) 


^In the singly-coupled theory, equation (6.22) would he a constraint equation arising from the Bianchi identity 
and stress-energy conservation. When using the effective coupling, the stress-energy conservation holds with 
respect to the effective metric, rather than g^v or /^v This gives rise to the pressure-dependent term in the 
left bracket. Due to this term, both branches—obtained by setting either bracket to zero—can be regarded as 
dynamical. We choose to adopt the terminology from the singly-coupled case here, however. 
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If the Universe is dominated by dust ip — 0), then as in the singly-coupled theory this is a 
polynomial equation for y, 

I3i+2l32y + — 0, (6.24) 

which is solved by a constant y —yc- Notice that when y is constant, the mass terms in the 
two Friedmann equations become constant, so Hg and Hf are determined by Friedmann 
equations containing effective cosmological constants.^ Using the fact that a — (a + ySy^) Ug — 
{a/yc +P)af,we can show that the observed Hubble rate, H — d/iaN), for a constant y is 
given by 

H = H,{a + fS^\ ■ (6.25) 

If the ratio Ng/Nf is constant, the solutions on this branch contain an exact cosmological 
constant (at least at the background level) given by a combination of the metric interaction 
terms. 

Since for a constant y, the two Hubble rates are related by 


Hf^H, 


’Nf’ 


(6.26) 


the bimetric interactions mimic a cosmological constant when Hg/Hf — const. This is only 
possible if the parameters satisfy ay^^Bi = p Bq. For more general parameter values, we have 


Nf 


3aH2y3 


Ng J 3pH^^ + m^[aBiy^-pBoy 


(6.27) 


which is dynamical, so these cosmologies are not exactly ACDM. 

For nonzero pressure, p / 0, we can rewrite the constraint (6.23) as 


p2.. , + + 


i+2j^y + j^y"l = 


Miff 


(6.28) 


We will not attempt to classify the solutions in this more complicated scenario. However, we 


^These are not, however, ACDM cosmologies for the effective metric due to the nontrivial coupling to p. 






6.2. Cosmological Equations and Their Solutions 


145 


note that for the special parameter choice 


B 

a 


(6.29) 


we obtain 


p m'^Pi 


(6.30) 



i.e., we are required to have a constant p, corresponding to a vacuum equation of state, 
ic = —1. 

6.2.2 Dynamical Branch of the Bianchi Constraint 

As is most often done in singly-coupled bigravity models—see, for example. Refs. [75, 79, 80, 
85, 141] and chapters 2 to 4 —in the remainder of this chapter we will restrict our study to 
solutions where the second bracket in equation (6.22) vanishes, as these will turn out to be 
consistent with observational data. In this branch we have a dynamical constraint on the 
ratio between Nf and Ng, 


Nf df duf 


(6.31) 



This implies the simple relation Hfy = Hg. Furthermore, the physical Hubble rate H, defined 
as 


a 


H= -, 

Na 


(6.32) 


becomes 


Hs yHf 


(6.33) 


a+Py a+Py 


Combining the two Friedmann equations, we obtain the equations for H and y, 



(6.34) 



(6.35) 
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Equations (6.34) and (6.35) determine the expansion history completely and are invariant 
under the comhination of —> yS 4 _„, a andy ^y“h They have the same structure as 

in singly-coupled bigravity (cf. section 2.1.3): there is a single Friedmann equation sourced by 
p and y, while y evolves according to an algebraic equation whose only time dependence 
comes from p. Notice that due to equation (6.35) one can write many different, equivalent 
forms of the Friedmann equation for H^. It is therefore dangerous to directly identify the 
factors in front of p in equation (6.34) as a time-varying gravitational constant and the term 
proportional to as a dynamical dark energy component: both of these effects are present, 
but they cannot be straightforwardly separated from each other. 

From equation (6.35), we see that as p ^ oo in the far past, either y —> yS/a or y ^ 
—a/p. One can show that if p ~ a“P then ~ a~'^Pl^ as y —> — a/yS. Since this scenario is 
observationally excluded, we will not consider this limit. Recall from section 2.1.3 that in the 
singly-coupled theory there are also infinite-branch solutions where y ^ oo at early times 
[85]. Indeed, as we saw in chapter 3, these infinite-branch solutions are crucial in order to 
avoid linear instabilities. However, in the doubly-coupled theory, there are no solutions to 
equation (6.35) in which y ^ oo as p —> oo. This is because of the new term proportional to 
aySy^p; none of the terms in Bq — y^Bi, which grows at most as y^, can possibly cancel off 
this term as p —> oo. 

An interesting feature is that in the early Universe the mass term drops away but we are 
left with a modification to the gravitational constant. 




{a^ + P‘^)p 

3Mgg 


(6.36) 


Since the coefficient in front of p in the Friedmann equation during radiation domination 
can be probed by big bang nucleosynthesis, this could in principle be used to constrain the 
parameters of the theory. However, this will only work if the corresponding factor in front of 
p in local gravity measurements has a different dependence on a and yS. The solar-system 
predictions for this theory have not, to date, been worked out. 
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In the far future, as p ^ 0, we have two possibilities. The first is that y goes to a constant 
yc, determined by 


P'iYc + (3^2 ~ A) Tc ^ (^1 ^ ^3) yl + (^0 “ 3^2) yc~ Pi — 0. (6.37) 


These models approach a de Sitter phase at late times (whether they se//-accelerate is a subtle 
question which we address below), with a cosmological constant given by 


[^1 + (^0+3^2) Tc +3 (^1+^3) y^ + {PP2+Pi) y^+P^y^^ 
2yc[a + PycY 


(6.38) 


The second possibility is that, for some parameter choices, |y | ^ 00 such that the leading- 
order Pn term in equation (6.35) exactly cancels the leading density term, y'^p.lt is unclear 
whether these solutions are viable; in this chapter, we will restrict ourselves to solutions 
where y is asymptotically constant in the past and future, starting dXy — p/a and ending with 
y — yc- This implies that Ug and a / are proportional to one another in both the early and late 
Universe. As long as y does not exhibit any singular behaviour, the evolution between y — p/a 
and y = yc is monotonic. This can be seen by taking a time derivative of equation (6.35) and 
setting y = 0. 


The monotonicity of the evolution of y implies that in the special case where yc — P /«> 
then we will have y — P/aat all times, and the expansion history is identical to ACDM. This 
is a new feature of the doubly-coupled theory: in the singly-coupled case, yc becomes zero 
in the presence of matter, which makes such a case trivially identical to general relativity. A 
constant y occurs in any model where the Pn parameters and p/a are chosen to satisfy 

^3^—^ +(3^2“A)^~^ T3(^i—^ 3 )^—^ + (^ 0 “ 3 ^ 2 ) = 0, (6.39) 

which is simply equation (6.37) with yc— P /ol. An interesting implication of solutions with 
constanty is that, since equation (6.31) implies Ai//Aig = da//dag = y, the two metrics are 
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proportional, f^y =y2g^^.6 


6.3 Comparison to Data: Minimal Models 


In this section, we compare the background expansion derived above to observations and 
perform a parameter scan of the minimal models, in which only one of the p„ is nonzero. Due 
to the duality property of the solutions, we only have to look at the ySo, P\, and P 2 cases. We 
will restrict ourselves to positive p/a-, in principle negative values could also be allowed, but 
we have not yet investigated the physical implications of these values. ^ The minimal models 
admit exact ACDM solutions when P/a — |o, l| for the Po, Pi, and P 2 cases, respectively, 
as is evident from equation (6.39). 

Since we have so far calculated the equations of motion only for homogeneous back¬ 
grounds, we will limit this study to purely geometrical tests of the background expansion, 
including the redshift-luminosity relation of Type la supernovae (SNe) [148], the observed an¬ 
gular scales of cosmic microwave background (CMB) anisotropies [36], and baryon-acoustic 
oscillations (BAO) [150, 152, 176]. Since the latter two depend on the physical size of the 
sound horizon scale around the time when the CMB photons decoupled from the baryon 
plasma, we can cancel out this dependence by using only the ratio of the observed angular 
scales in the CMB and BAO [79, 177]. In this way, we obtain a cosmological probe that is 
highly insensitive to the physics of the early Universe, and almost exclusively sensitive to the 
expansion history of the Universe between z ~ 1000 and today. 

We can calculate the effective equation of state for the background model described in 
eqs. (6.34) and (6.35) using 


Weff=-1 


1 dlogH^ 


(6.40) 


3 dloga 

Since in this chapter we restrict ourselves to solutions where y approaches constant values 


®It is not difficult to see that there are no cases in which the two metrics are related hy a dynamical conformal 
factor; from equation (6.31) any conformal relation means that daf/dag — Uf/ag, but this implies af/ug — 
const. 

^Note that )3 < 0 leads to instabilities in the case of doubly-coupled dRGT massive gravity, in which one of the 
metrics is nondynamical [173]. 
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in the infinite past and future, for matter-dominated models we are guaranteed to have an 
effective equation of state where Wes ^ 0 as a ^ 0 (ignoring radiation) and Weff —> -1 as 
a —> 00 , mimicking the asymptotic behaviour of the ACDM model. Except for some special 
parameter choices which are exactly ACDM (see the discussion above, as well as section 6.4), 
we expect the model to deviate from the concordance model at all finite times. 

It is well-known that ACDM is able to provide an excellent fit to background expansion 
data, so we expect the success of the bimetric model to depend on how close the effective 
equation of state is to that of ACDM. All solutions that look exactly like ACDM will trivially 
be able to fit existing background expansion data. Note, however, that this does not mean 
that these models are equivalent to ACDM, since they may give different predictions for 
perturbations, i.e., when studying structure formation. 


In figure 6.1, we study the ySo model, i.e., when only ySo is turned on. Notice, cf. equa¬ 
tion (6.34), that this model has no nontrivial interactions between the two metrics, so it 
deviates from ACDM only through the novel matter coupling. In the left panel of figure 6.1, 
we compare the effective equation of state for different values of p/a with that of ACDM. We 


fix Dm = 0.3, where 


Dm - 


a^Po 

3M^ 


(6.41) 


and the subscript 0 indicates a value today. In the right panel of figure 6.1, we plot background 
constraints on D^ and P/a. Note that the value of Po is set by the requirement that we have a 
flat geometry. Shaded contours show constraints from SNe and CMB/BAO data, respectively, 
corresponding to a 95% confidence level for two parameters. Combined constraints are 
shown with solid lines corresponding to 95% and 99.9% confidence levels for two parameters. 
As expected, when p/a^O, the effective equation of state coincides with ACDM since this 
limit corresponds to the singly-coupled case where Po acts as a cosmological constant. Note 
also that as p/a is increased, so is the factor multiplying the matter density in the Friedmann 
equation, and therefore the preferred matter density, D^, becomes smaller. 


In figure 6.2 we plot background constraints on the P\ and Pz models. Since we know that 
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Figure 6.1: Left panel: The effective equation of state, Wen, for the ySo model with 0 < yS/a < 1 
(dotted lines) compared to Wen of the ACDM model (solid line). When yS/a ^ 0, the effective 
equation of state for the ySo model approaches that of the ACDM model. In all cases, Dm = 0.3. 
Right panel: Confidence contours for and yS/a for the ySo model as fitted to SNe, CMB, 
and BAO data. 


the values yS /a = ^ and yS/a = 1 give exact ACDM solutions for the ySi - and yS 2 -only models, 
respectively, we expect these values to provide good fits to the data. This is indeed the case, 
as can be seen in the plots. The P 2 model is especially interesting in this regard, as yS/a = 1 
corresponds to the case where the two metrics and f^y give equal contributions to the 
effective metric (or Mg —Mf when using the equal coupling strength framework described in 
appendix C). Notice that the P 2 model favours yS > 0, as we would expect since the yS 2 -only 
singly-coupled model is not in agreement with background data [80] and is ruled out by 
theoretical viability conditions [85]. 

One of the attractive features of the double coupling is that it allows sensible cosmological 
solutions with only one of the y3„ turned on. For more general combinations of the y3„ 
parameters, we expect the data to favour values that cluster around the value of yS / a given by 
solving equation (6.39), since this value yields an exact ACDM background expansion. We do 
not find it meaningful to do such a parameter scan at this moment, since it is only by including 
other probes, such as spherically symmetric solutions and cosmological perturbations, that 
we can exclude a larger part of the parameter space. However, in the next section, we discuss 
a few special cases that may turn out to be of particular interest for further investigations. 
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Figure 6.2: Confidence contours for and yS/a for the ySi and Pz minimal models as fitted to 
SNe, CMB, and BAG data. In each case, we are able to obtain as good a fit as the concordance 
ACDM model. 


6.4 Special Parameter Cases 


6.4.1 Partially-Massless Gravity 

Partial masslessness arises when a new gauge symmetry is present that eliminates the helicity- 
0 mode of the massive graviton,® removing two of the problems with massive gravity discussed 
in section 2.1: the vDVZ discontinuity in the m ^ 0 limit of linearised massive gravity [52, 53] 
and the need for Vainshtein screening to reconcile the theory with solar system tests [54] . 
This is because both of these aspects of massive gravity are direct results of the fifth force 
mediated by the helicity-0 mode. Moreover, this new gauge symmetry would both determine 
the cosmological constant in terms of the graviton mass and protect a small cosmological 
constant against quantum corrections. Thus it is potentially a solution to both the old and 
new cosmological-constant problems: why the cosmological constant is not huge, and why it 
is not exactly zero, respectively. 

Massive gravity and bigravity contain a candidate partially-massless theory [178, 179], 

®So that a partially-massless graviton has four polarisations rather than the five of a massive graviton, hence 
the name. 
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obtained by making the parameter choices 

— — Pi—Ps — O. 


(6.42) 


For more on partially-massless gravity and its connection to massive (bi) gravity, we refer the 
reader to Ref. [180], as well as Refs. [76,179] and references therein. In singly-coupled bigravity, 
the partially-massless parameter choices could only be imposed in vacuum; including matter 
forces y to be zero, which trivially reduces to general relativity. The nontrivial implications of 
the partially-massless scenario have been demonstrated for other doubly-coupled bigravity 
theories (see Ref. [77], though note that the theory discussed therein appears to have a ghost 
[72, 73]). Here we discuss this class in the context of the present doubly-coupled theory. 


For the partially-massless parameter choices, equation (6.35) implies that y = yS/a at all 
times, and the Friedmann equation becomes 


2 a^ + P^ m^Po 
~ 3Ml^ ^^3ia^+P^y 


(6.43) 


The cosmology of the candidate partially-massless theory is therefore equivalent to standard 
ACDM with an effective cosmological constant, m‘^Po/{a^ -F p^), and a rescaled gravitational 
coupling for matter. Consequently, the background expansion is identical to that of general 
relativity, albeit with shifted constants. Notice that this is a qualitatively new feature as 
compared to the singly-coupled theory. 


Doubly-coupled bigravity with the parameters (6.42) is thus a strong candidate partially- 
massless theory of gravity. In the context of single-metric (dRGT) massive gravity, with matter 
coupled only to the dynamical metric, this parameter choice leads to a theory which is not 
partially massless and in fact suffers from an infinitely strongly-coupled helicity-0 mode 
[181]. If doubly-coupled bigravity is shown to possess the partially-massless gauge symmetry 
nonlinearly and around all backgrounds, it should automatically become one of the most 
interesting available theories of gravity beyond general relativity. 
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6.4.2 Vacuum Energy and the Question of Self-Acceleration 

As discussed in chapter 1, one of the primary motivations for modifying general relativity 
is the possibility of having self-accelerating solutions, i.e., cosmologies which accelerate at 
late times even in the absence of a cosmological constant or vacuum energy contribution. In 
general relativity, as well as in singly-coupled bigravity, these two are degenerate: the vacuum 
energy and a cosmological constant may have different origins, but they are mathematically 
indistinguishable. In bigravity with matter coupled to the effective metric, however, this 
question becomes rather subtle, as the vacuum energy from the matter sector produces more 
than just the cosmological constant terms for g^v and which are equivalent to ySo and 

We have shown in section 6.1 that quantum corrections to matter coupled to g®® will 
generate all of the ghost-free bimetric interaction terms. If we take the matter loops to 
generate a cosmological constant term ■\J-gen^v, then we can see from equations (6.4) 
and (6.5) a pure vacuum-energy contribution can be written in the form of the bigravity 
interaction potential with parameters 






(6.44) 


Let us assume that the parameters take this particular form, i.e., the only metric 
interactions arise from matter loops. The quartic equation (6.35) can then be solved only if 
y — P/a{or p — and the Friedmann equation becomes 

P {a^ + P'^)p {a^ + P‘^)Kv 

H2 = -5^-+ -5^- '—L-Ji .. (6.45) 

Ml^ 3 

Equations (6.44) and (6.45) reduce to the known expression for the ACDM solutions with a 
cosmological constant proportional to either ySo or in the singly-coupled limit (where either 
yS ^0 or a^O). 

It is, of course, not surprising that matter loops lead to an accelerating expansion. How¬ 
ever, the appearance of the vacuum energy in all the bigravity interaction terms has novel 
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implications. First, because the vacuum energy contributes to all the interaction terms, the 
mass scale m is not protected against quantum corrections from matter loops [73]. Therefore, 
any values we obtain for these parameters from comparison of the theory to observations 
must be highly fine-tuned.® This is in contrast to singly-coupled bigravity, in which the only 
parameter that receives contributions from quantum loops is ySo (if one couples matter to 
g^v), just as in general relativity where the cosmological constant is unstable in the presence 
of matter fields. In the singly-coupled theory, both the scale m and the structure of the 
interaction potential are stable to quantum corrections [ 86 , 182] , a very useful fact which is 
lost once we couple matter to This is not a problem in the double coupling studied in 
chapter 5, as loops would only induce g- and /-metric cosmological constants, /o and ^ 4 , 
although that theory is not ghost-free. Candidate expressions for gjj® where the matter sector 
would only contribute quantum corrections to /o and ^4 have been studied in Ref. [164], 
although it is not yet known whether any of these are free of the Boulware-Deser ghost at low 
energies. 

The other implication is that self-accelerating solutions are no longer straightforward 
to define in this theory. Typically, self-acceleration refers to cosmologies which accelerate 
at late times even when the vacuum energy is set to zero. Since in general relativity and 
singly-coupled bigravity, there is a single parameter which is degenerate with the vacuum 
energy (A in the former and /o or ^4 in the latter), one can simply set its value to zero and 
look for other accelerating solutions. In the present doubly-coupled theory, however, all 
interaction terms are degenerate with the vacuum energy: given an interaction potential, 
there is no way to unambiguously determine the value of A„. In that respect, we cannot set 
some of the parameters to zero in order to restrict ourselves to accelerating solutions arising 
from nonvacuum, massive-gravity interaction terms (unless we set all the parameters to zero, 
which will give uninteresting solutions). Therefore, from a particle physics point of view this 
theory lacks, or at the very least cannot unambiguously define, self-accelerating solutions. 

®If the case described in section 6.4.1 is truly partially massless, this may be an exception, as there is a new 
gauge symmetry to protect against quantum corrections. 

^“Indeed, the fact that a small graviton mass is stable against quantum corrections is one of the main motiva¬ 
tions for studying massive (hi) gravity, particularly as a candidate to explain the accelerating Universe. 




6.5. Summary of Results 


155 


6.4.3 Maximally-Symmetric Bigravity 

The parameter choice 

= Pi =133, a = p, (6.46) 


is special in the sense that the duality transformation (6.3) maps solutions to themselves.^^ 
Thus this theory is maximally symmetric between the two metrics: they appear in the theory 
in completely equal ways. In this case, the quartic equation (6.35) becomes 




(y ^ +1) + 3^2y “ Pay + 


a'^p 

m^Ml^ 


(i+yf 


= 0 . 


(6.47) 


As expected, there is an exact ACDM solution given by y = 1. Indeed, the two metrics are 
completely equal, g^v = f/iv, because the Bianchi constraint imposes Nf/Ng — daf/dag — 1. 
The second-order polynomial for y in brackets gives two solutions which are inverses of one 
another. This is not surprising, since when g^v f/iv we have y —> y “k 


6.5 Summary of Results 

In this chapter we have presented the main features of the background expansion for massive 
bigravity with matter “doubly coupled” to both metrics through an effective metric, given by 

= a^g^,. + 2a^g^„X"v +P^fi,v, ={y/ g~^fTv (6.48) 

This coupling was introduced in Refs. [73, 163], and has been further discussed in Refs. [161, 
162,172,173]. This matter coupling has several advantages: it retains the metric-interchange 
symmetry in the presence of matter, leads to sensible cosmological solutions, and has a 
straightforward physical interpretation. 

The expansion history is described by a Friedmann equation for the effective metric (6.34) 
and a quartic equation (6.35) which algebraically describes the evolution of y = a f/ag, the 


^ ^Vacuum solutions for this model were previously studied in Ref. [76]. 
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ratio of the /- and g-metric scale factors. One can always choose the parameters of the theory 
such that the background expansion is exactly that of ACDM; any parameter choice which 
leads toy — p/ain equation (6.35) will have this behaviour. For more general parameter 
values, the background expansion will deviate from ACDM but may still be consistent with 
observational data. To explore this, we confronted the models with only ySo, Pi, or P 2 nonzero 
with observational data. The other single-parameter models—with Ps or P 4 nonzero—are 
then automatically included in this analysis due to the duality between solutions under 
giiv ^ fliv, Pn Pi-n, and a<^ p,SLS described in appendix C. 

A novel feature of the effective coupling studied here is that g^y and can be conformally 
related to each other at the background level in the presence of matter. In the singly-coupled 
case, this is only possible in vacuum, where the solutions are de Sitter. A special example of 
this is the parameter choice leading to a candidate partially-massless theory. This potentially 
has a novel gauge symmetry which would eliminate the problematic fifth force and protect a 
small vacuum energy against quantum corrections. In this case the background is identical to 
ACDM in the presence of matter. This suggests that doubly-coupled bigravity is a promising 
candidate for a theory of partially-massless gravity. 

This matter coupling has a problematic feature, namely that loop corrections for any 
matter coupled to g®® will generate all five dRGT interaction terms. Therefore the structure of 
the potential and the mass scale m lose their stability against quantum corrections, which 
had been one of the most impressive features of the singly-coupled theory. We have discussed 
an important consequence of this: while many solutions to the theory accelerate at late times, 
it is no longer possible to unambiguously identify solutions that self-accelerate, as the effective 
cosmological constant at late times can always be identified at least in part with a vacuum 
energy contribution. 

We end with a brief comment concerning our expectations for perturbations around 
these cosmological solutions. We have shown in chapter 3 that the singly-coupled models 
are often unstable for small y. One might hope that these doubly-coupled models will have 
better stability properties: y is always nonzero and can be made to have a large minimum 
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value by tuning yS /a. Moreover, we found in chapter 3 that the yS 2 -only model did have stable 
perturbations in the singly-coupled case, but that model is not viable in the background. As 
we have shown, this model is in excellent agreement with background data if yS /a is not too 
small, so it may provide another avenue for stable cosmological solutions in massive bigravity. 




If the Lord Almighty had consulted me before embarking upon his creation, I should 
have recommended something simpler. 


Alfonso X of Castile 


Cosmological Implications of 
Doubly-Coupled Massive Gravity 



In section 2.1.3 we described a no-go theorem for cosmological solutions in dRGT massive 
gravity, i.e., in the theory where the only gravitational degree of freedom is a massive graviton. 
If the reference metric is taken to be that of Minkowski space, then dynamical flat and closed 
FLRW solutions do not exist; the Bianchi constraint (2.55) restricts the scale factor to be 
constant. This can be avoided by either choosing open solutions or changing the reference 
metric, but the resultant solutions are unstable. Therefore, the search for a viable cosmology 
with a massive graviton has necessarily involved extending dRGT by adding extra degrees of 
freedom (as in the bimetric theory which we have studied in chapters 3 to 6) or by breaking the 
assumptions of homogeneity and isotropy, either in the metric or in the Stiickelberg sector. 

The double coupling discussed in chapter 6 has been shown to avoid both of these no-go 
theorems, opening up the intriguing possibility of obtaining sensible cosmological solutions 
with only a single massive graviton [73,173]. In this scenario, matter is coupled to an effective 
or Jordan-frame metric, 

S'^v = “^S'Mv + 2a^g^„X“v-H^^}7^v, (7.1) 
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where g^v is the dynamical metric, is the Minkowski reference metric, and = (\/g“^J7)^v 
The properties of this effective metric were discussed in some detail in section 6.1. However, 
we remind the reader that the theory with this matter coupling is believed to be ghost-free 
at least within the effective theory’s regime of validity and that the Boulware-Deser ghost is 
absent about FLRWbackgrounds [73,161, 162]. 

In this chapter we explore the basic properties of these newly-allowed massive gravity 
cosmologies. Unusually, the proof in Ref. [73] that the no-go theorem is avoided turns out 
to rely crucially on coupling a fundamental field (in this case, a scalar field) to the effective 
metric. In a standard late-Universe setup where matter is described by a perfect fluid with a 
constant equation of state (or even more generally when w only depends on the scale factor), 
this result does not hold, and FLRW solutions are constrained to be nondynamical, just as in 
standard dRGT. More generally, the pressure of at least one component in the Universe must 
depend on something besides the scale factor—such as the lapse or the time derivative of 
the scale factor—for massive gravity cosmologies to be consistent. This is why fields, which 
have kinetic terms where the lapse appears naturally, are required in order to obtain sensible 
cosmological solutions. Consequently the standard techniques of late-time cosmology cannot 
be applied to this theory. 

While we do not aim to rule out these models, the inability to obtain cosmological solu¬ 
tions with just, e.g., dust or radiation is an unusual feature which makes it difficult to derive 
precise predictions for cosmology, as the nature of the “extra matter” is not presently known. 
These solutions exhibit pathologies in the early- and late-time limits if all matter couples 
to the effective metric, and the scalar field physics would need to be highly contrived to 
avoid these issues. Moreover, the reliance on extra matter, such as a scalar field, which may 
well be gravitationally subdominant and high-energy implies a violation of the decoupling 
principle, in which the low-energy expansion of the Universe should not be overly sensitive to 
high-energy physics. 

The rest of this chapter is organised as follows. In section 7.1 we derive and discuss the 
cosmological evolution equations in this theory. In section 7.2 we elucidate the conditions 
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under which the no-go theorem is violated and dynamical cosmological solutions exist. We 
discuss in section 7.3 some of the nonintuitive features of the Einstein-frame formulation 
of the theory, and how these are resolved in a Jordan-frame description. In section 7.4 we 
study cosmologies containing only a scalar field, and generalise this to include a perfect fluid 
coupled to the effective metric in section 7.5. In section 7.6 we consider an alternative setup 
in which the scalar field couples to the effective metric while the perfect fluid couples to the 
dynamical metric. We conclude in section 7.7. 


7.1 Cosmological Backgrounds 


The Einstein equation with all matter fields coupled to g®® was derived in Ref. [174] (see also 
section 6.1) and can be written in the form^ 


+ m^Yj,-irP„ g“^(X-i)f^^det {a + px) (a(X-i)(%r +PT^''), 

n=0 PI 

(7.2) 

where the stress-energy tensor is defined as usual with respect to the effective metric. 


rpllV _ 




y^S'eff 




eff 

iiv 


(7.3) 


and the matrices are defined in equation (2.41). Let us assume a fiat FLRW ansatz for g^v 

of the form^ 

gij^vdx^^dx'’ = -N^[t]dt^ + a^[t)5ijdx^dx^, (7.4) 


and choose unitary gauge for the Stiickelberg fields, = diag(-l, 1,1,1), so the effective 
metric is given by 


g’^^dx^dx'' — -N'^f^t]dt^ + al^t]5ijdx'‘dx-', (7.5) 


^Our convention is that indices on the Einstein tensor GP'’ are raised with gP’’. 

^Note the differences in notation between this chapter and chapter 6, such as our use of a for the scale factor 
of gyjv rather than of g®®. 
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where the effective lapse and scale factor are related to N and a by 


Neff — aN + p, Uen — aa+p. 


(7.6) 


We will define the Hubble rates for g^v and g®® by 


a 


(X pff 

U — 

(^esNefi 


H= -, 

aN 


(7.7) 


Notice that, because of the inclusion of the lapses in these definitions, these quantities 


correspond to what would be the cosmic-time Hubble rates in general relativity, obtained by 
setting iV = 1 or iVeff = 1. While we need not include the lapse in the definition of H when 
working with diffeomorphism-invariant theories like general relativity or massive bigravity, 
instead choosing to set iV to a convenient value and thereby pick a physically-meaningful 
time coordinate like cosmic time or conformal time, the lack of diffeomorphism invariance 
in massive gravity means that neither the lapse nor the time coordinate has any meaning 
on its own, but will only appear through the comhination Ndt. The time component of 
equation (7.2) yields the Friedmann equation. 



(7.8) 


where p = -goo 7^“ is the density of the matter source. The spatial component of equa¬ 
tion (7.2) gives us the acceleration equation. 




2H ^ ap iVeffa^ff 
Na^ 



where the pressure is defined by p = (l/3)g??rF . Notice that the double coupling leads to 
a time-dependent coefficient multiplying the density and pressure terms in equations (7.8) 
and (7.9). The Friedmann equation for the effective Hubble rate, Hes, can be determined from 
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equation (7.8) by the relation 


//eff=a 


Na 


which follows from equation (7.6). 


Matter is covariantly conserved with respect to 


yeff ^ 


(7.10) 


(7.11) 


from which we can obtain the usual energy conservation equation written in terms of the 
effective scale factor, 

deff A ^ 

P+3-(p + p)=0. (7.12) 

<^eff 

As in general relativity, this holds independently for each species of matter as long as we 
assume that interactions between species are negligible. Finally, we can take the divergence 
of the Einstein equation (7.2) with respect to g^v and specialise to the FLRW background to 
find, after imposing stress-energy conservation, the “Bianchi constraint,” 


m^Mpja^P(a)d = a^al^^pd, 


(7.13) 


where we have defined 

2 62 63 

P(a) = A + —+ ^- (7.14) 

a a'^ 

This can equivalently be derived using equations (7.8), (7.9), and (7.12). The pressure, p, 
appearing in equation (7.13) is the total pressure of the Universe, or, if different species couple 
to different metrics, the total pressure of all matter coupling to g®f(. 

Let us pause to count the number of equations and variables in this system. We have four 
free functions—the scale factor, the lapse, the density, and the pressure—and four equations— 
Friedmann, acceleration, conservation, and Bianchi constraint. Of the four equations, only 
three are independent, much like in general relativity. The remaining freedom is fixed by 
specifying an equation of state. The acceleration equation can usually be derived from the 
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other three, hut unlike in general relativity it is not always redundant: if the Bianchi constraint 
yields d = 0, then the acceleration equation does give new information, and in fact is what 
would be used to determine the lapse [94]. This situation is similar to general relativistic 
cosmology, but with one new variable and one new equation: because we have broken 
diffeomorphism invariance, the lapse cannot be fixed by a coordinate transformation, and 
furthermore the divergence of the Einstein equations leads to a nontrivial constraint. This is 
in contrast to general relativity, where the same procedure results in an identity.^ 

We emphasise that if all matter couples to the same g®® then the expansion history 
inferred from observations is given by aeff and iTeff> for the simple reason that all observations 
are observations of matter (including light). In deriving any cosmological observables, the 
“proper time,” dz = Ne^dt, will play the same role as the cosmic time coordinate in general 
relativity. In particular, t corresponds to the time measured by point-particle clocks, while 
the distance light travels is given by dr = dz/aes[T). Therefore in principle we need only 
know iTeff(<2eff) in order to connect to standard background observables. The coordinate time, 
t, is just the coordinate in which the reference metric, has the standard Minkowski form, 
and has no other physical significance. 

Since g^v and g^^J play the exact same roles as the Einstein-frame and Jordan-frame 
metrics, respectively, in other modified gravity theories, we will use these terms freely. 


7.2 Do Dynamical Solutions Exist? 


In the original, singly-coupled formulation of massive gravity, P —0 and so the right-hand side 
of equation (7.13) vanishes, with the result that a is constrained to be constant. This is nothing 
other than the no-go theorem on fiat ELRW solutions in massive gravity. A nondynamical 
cosmology is, of course, still a solution when a and yS are nonzero, in which case the values 
of a and N are determined from equations (7.8) and (7.9). The question is now under which 

^This is because the Bianchi identity and stress-energy conservation are related to the diffeomorphism 
invariance of the Einstein-Hilbert and matter actions, respectively, but we have now added a mass term which 
does not obey this gauge symmetry, after fixing the Stiickelherg fields. 
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circumstances the theory also allows for dynamical a. 

To begin with, let us assume that p — wp, where w can depend on the effective scale 
factor but nothing else. Assuming that d / 0, equation (7.13) becomes 

m^Mp^a^P{a) — aPwal^p, (7.15) 


and p is a function only of a (or equivalently aeff). To see this, consider equation (7.12) in the 
form 


dlnp 
din a 


■3[l + it'(a)] =0. 


(7.16) 


Integrating this will clearly yield p — p{a). Unless the left-hand side of equation (7.15) has 
the exact same functional form for a as the right hand side (which is, e.g., the case when 
w — -1/3 and Pz — P 3 — O), this equation is not consistent with a time-varying a. The theory 
does therefore not give viable cosmologies using the standard equation of state p — wp, 
where w is constant or depends on the scale factor. 


This conclusion is avoided if the pressure also depends on the lapse. In this case, equa¬ 
tion (7.13) becomes a constraint on the lapse, unlocking dynamical solutions.^ The most 
obvious way to obtain a lapse-dependent pressure is to source the Einstein equations with 
a fundamental field rather than an effective fluid. This was exploited by Ref. [73] to find 
dynamical cosmologies with a scalar field coupled to g®®. We discuss this case in more detail 
below. Therefore, while physical dust-dominated solutions may exist, we must either include 
additional degrees of freedom or treat the dust in terms of fundamental fields. The standard 
methods of late-time cosmology cannot be applied to doubly-coupled massive gravity. 


^Another possibility is that the pressure depends on d. The dynamics would be determined by equation (7.13), 
while the lapse would be constrained by the Friedmann equation. It is unclear whether these would give rise to 
Friedmann-like evolution, and we do not discuss this case any further. 
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7.3 Einstein Frame vs. Jordan Frame 


Before examining the cosmological solutions when the pressure depends on the lapse, it 
behoves us to further clarify the somewhat unusual differences between this theory’s Ein¬ 
stein and Jordan frames. It turns out that the Friedmann equation in the Einstein frame is 
completely independent of the matter content of the Universe (up to ein integration constant 
which behaves like pressureless dust): H[a) always has a predetermined form [see equa¬ 
tion (7.19)]. In the Einstein-frame description, matter components with nonzero pressure 
affect the cosmological dynamics through the lapse, N. Because the lapse is involved in the 
transformation from the Einstein frame, H, to the Jordan frame, Hen, cf. equation (7.10), 
the Jordan-frame Friedmann equation (corresponding to the observable Hubble rate) does 
depend on matter. 


We proceed to demonstrate this explicitly. Regardless of the functional form of p, and 
whether or not it depends on the lapse, for d / 0 the pressure is constrained by equation (7.13) 
to have an implicit dependence on a given by 


p(a) = 




(7.17) 


The continuity equation (7.12) can then be integrated to obtain 

C f Bi , „ ^ 

p(a)=^ -—^— \—a^ + P2a^ + P3a\, (7.18) 

'^eff P^eff V 3 J 

where C is a constant of integration that includes any pressureless dust. Inserting this into 
equation (7.8) we find a generic form for the Einstein-frame Friedmann equation. 


3H^ — (^Co + 3cia ^+3c2a ^-j-Csa , 


( 7 . 19 ) 
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where we have defined the coefficients 



(7.20) 


Notice that the functional forms of pia), p(a), and H\a) are completely independent of the 
energy content of the Universe, except for an integration constant scaling like pressureless 
matter. It is interesting to note that in the vacuum energy case studied in section 6.4.2 
with p„ — [alP)Pn+i, all of the C/ coefficients apart from vanish. In other words, if the 
metric interactions took the form of a cosmological constant for then the Einstein-frame 
Friedmann equation would scale as a“^. 


7.4 Massive Cosmologies with a Scalar Field 

If we include matter whose pressure does not only depend on the scale factor, Ues, then the 
Bianchi constraint (7.13) may not rule out dynamical cosmological solutions. For a pressure 
that also depends on the lapse, equations (7.13) and (7.19) determine H and N, which in turn 
can he used to derive the Jordan-frame Friedmann equation. Because the lapse enters into 
the frame transformation (7.10), the Jordan frame can he sensitive to matter even though, as 
discussed above, the Einstein frame is not. The lapse thus plays an important and novel role 
in massive gravity compared to general relativity. 


As discussed above, lapse-dependent pressures are not difficult to obtain: they enter 
whenever considering a fundamental field with a kinetic term. Consider a universe dominated 
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by a scalar field, with the stress-energy tensor 


= KrXKnX - I + V{x]] 


(7.21) 


where = geff V®® and is the potential for the scalar field. The density and pressure 
associated to x are 

*2 * 2 

P:< = ^ + Vixl- P^ = ^- '^- 22 ) 

The constraint (7.13) now has a new ingredient: the lapse, Nes, which appears through the 
scalar field pressure.® 

One can then use the Bianchi identity to solve for the lapse and substitute it into the 
Friedmann equation to obtain an equation for the cosmological dynamics that does not 
involve the lapse [73]. A simple way to substitute out the lapse is to use the relation, following 
straightforwardly from equation (7.13), 


y2 m^Ml.a^P{a) 


2N: 


eft 


aySa 


2 

eff 


(7.23) 


as the lapse only appears in the Einstein-frame Friedmann equation through x^/lN^^^. This 
explains the result, first noticed in Ref. [73], that after solving for the lapse, the Friedmann 
equation loses its dependence on the kinetic term. Note however that we can also use 
equation (7.23) to solve for the potential, F(/), and write the Einstein-frame Friedmann 
equation in a form that does not involve the potential. Of course, if we were to additionally 
use the continuity equation as discussed above, the Einstein-frame Friedmann equation 
would take the form of equation (7.19) which contains neither the kinetic nor the potential 
term. 


Using equations (7.17) and (7.18) we can find expressions for the kinetic and potential 


^The theory studied in Ref. [73] can be obtained by setting Po = “3, fii —3/2, P 2 = —1/2, and P 3 — O [45]. 
With this parameter choice, the Bianchi constraint (7.13) reproduces eq. (5.8) of Ref. [73]. 
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energies purely in terms of a, 



(7.24) 


(7.25) 


where K = the C/ are defined in equation (7.20), and we have further defined 



(7.26) 


Note that the terms proportional to C include any possible pressureless matter component 
coupled to g®®. This integration constant will always appear when solving the continuity 
equation (7.12). The Friedmann equation is given by the generic equation (7.19). That is, 
we are left with the peculiar situation that the pressure, energy density, and Einstein-frame 
Friedmann equation are completely insensitive to the form of the scalar field potential. As 
discussed above, this lack of dependence on the details of the scalar field physics is illusory; 
the lapse does depend on !/(/) and /, cf. equation (7.23), and in turn the physical or Jordan- 
frame expansion history depends on the lapse, cf. equation (7.10). 

Let us briefly remark on a pair of important exceptions. The no-go theorem forbidding 
dynamical a still applies when there is a scalar field present if either the potential does 
not depend on the lapse (such as a flat potential) or the field is not rolling. Let us rewrite 
equation (7.12) (which is equivalent to the Klein-Gordon equation) as 



(7.27) 


If V[x) is independent of Neff then cannot depend on Neff and, by extension, neither 
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can p — — V[x)- In the specific case of V{x) = const, this is clearly true, and we find 

(X a”|, so p — p[a). Similarly, if the field is not rolling, j = 0, then it is clear from 
equation (7.22) that p loses its dependence on the lapse. 

To conclude this section, when a scalar field is coupled to the effective metric, we avoid the 
no-go theorem and it is possible to have dynamical a, unless the potential does not depend 
on the lapse or the field is not rolling. This result agrees with and slightly generalises that 
presented in Refs. [73,173]. In a realistic scenario, however, we will have not only a scalar field 
but also matter components present. We now turn to that scenario. 


7.5 Adding a Perfect Fluid 

We have seen that the no-go theorem on FLRW solutions in dRGT massive gravity continues to 
hold in the doubly-coupled theory if the only matter coupled to the effective metric is a perfect 
fluid whose energy density and pressure depend only on the scale factor. This complicates the 
question of computing dust-dominated or radiation-dominated solutions in massive gravity. 
One solution would be to treat the dust in terms of fundamental fields. Another would be to 
add an extra degree of freedom such as a scalar field. Its role is to introduce a lapse-dependent 
term into the Bianchi constraint (7.13) and thereby avoid the no-go theorem. 

It is this possibility which we study in this section. In section 7.4 we examined the scalar- 
only case. Let us now include other matter components, such as dust or radiation, also 
coupled minimally to We assume that the density, pm, and pressure, p^, only depend on 
aeff.® We can then write the total density and pressure as 

p = K +V + Pm< 

p = K-V + p^, (7.28) 

®As discussed above and in Ref. [73], in principle any dust or radiation is made of fundamental particles for 
which the stress-energy tensor does depend on the lapse. We introduce this effective-fluid description because 
it is the standard method of deriving cosmological solutions in nearly any gravitational theory and is thus an 
important tool for comparing to observations. 
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so that 




P + p-iPm + Pm) 




p-p-iPm- Pm) 


(7.29) 


Note that equations (7.24) and (7.25) no longer hold, as they were derived without considering 
other matter, hut equations (7.17) and (7.18) are still valid and are crucial. 


We would like to investigate the cosmological dynamics of this model. Rather than 
explicitly solving for the lapse and substituting it into the Friedmann equation for Hen, which 
leads to a very complicated result, we will take advantage of the known forms of K{aen) and 
y(aeff), as well as the fact that Nen only appears in Hen and K through the operator 

d \ d 

^ =(7.30) 
d T A/gff d t 


The physical Fluhhle rate is given hy 


(X pff 

U — 

(^effNen 


ad 

effN^eff 


(7.31) 


Using the chain rule, we can write 

da da dV dy V'y 

^ _ _ A- _ A 321 

dt dV d/ dt (dV/da)' 

where a prime denotes a derivative with respect to /. We also know that / = Nen^/^, giving 


WNeffV^ 

[dV/da) 

which we can plug into equation (7.31) to obtain 

iVr2K 

alf^idV/daen)^' 


(7.33) 


(7.34) 
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This is the Friedmann equation for any universe with a scalar field rolling along a noncon¬ 
stant potential. Every term in equation (7.34) can be written purely in terms of Uefu allowing 
the full cosmological dynamics to be solved in principle. K and d V/daes are given in terms 
of Ueff by equation (7.29) [using equations (7.17) and (7.18)]. V' as a function of Uen can be 
determined from the same equations once the form of 1^(/) is specified. Note that while 
the lapse is not physically observable, its evolution in terms of a can then be fixed by using 
equation (7.10) to find 

m f V' 

__ 2j^ I _ 

Ngff yaaH(dV/daeff) 

where H{a] is given by equation (7.19). 

Assuming that the matter has a constant equation of state, we can use the known forms of 
K{a] and 17(a) to find a relatively simple expression for the Friedmann equation up to V', 

4a^^agff +‘l^iaeff +^P^eff) (7 36) 

^ |^3'i^o+4'i^iaeff + 5'i^2<^eff ~ 

where for brevity we have defined 


(7.35) 


^o = P [a^C + ^^^i + m^Mpj(3a(a^3-^^2))], 

Sfi = -2m^Mpj [a (a^3 - 2^^ 2 ) + , 

^2 = m^Mpj [PPi - , 

^p = -a^P{\ + w)p^. (7.37) 


Notice that the right-hand side is a function of a only. 

Let us examine the past and future asymptotics of these cosmologies, taking into account 
radiation [w — 1/3) in the former and dust (lu = 0) in the latter. At late times, taking Uen —> 00 
in equation (7.29), we find 

j2 mmi[pPi-aP 2 ) 

——>- —T7. -- 7.38 

2 Nl^ 
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vix) 




(7.39) 


We see that the scalar field slows to a halt: !/(/) approaches a constant, while dx/dz, where 
dz — iVeffd t is the proper time, approaches zero. Taking the late-time limit of the Friedmann 
equation (7.36), we obtain 



(7.40) 


Because / approaches a constant Xc at late times, V' — {d V/dx]\x=xc contributes a constant 
to the Friedmann equation. Therefore we find that iTeff generically blows up, which is poten¬ 
tially disastrous behaviour. This implies a violation of the null energy condition. Notice also 
that there is no guarantee that V — -ySim^Mpj/a^yS is within the range of V{x), assuming the 
scalar field potential is not set by gravitational physics. This may lead to further pathologies, 
as the form of V{a) would be inconsistent with large values of Uen- As we discuss below, if V' 
goes to 0 then, depending on the speed at which it does so, iTeff may be better behaved. 

At early times, demanding the existence of a sensible radiation era leads to further prob¬ 
lems. Assuming radiation couples to then pm ~ a~^ with pm = Pm/3. We have, cf. 
equation (7.29), that 2K — p + p — [p^ + p^), but, cf. equation (7.18), p and p do not have 
any terms scaling as steeply as a”ff. Therefore, in the presence of radiation, p^ and p^ pick up 
a negative term going as to exactly cancel out pm and p^, leading to < 0 at sufficiently 
early times. From equation (7.34) we see that this would lead to a negative and hence to 
an imaginary Hubble rate. Equivalently, we can take the early-time limit of equation (7.36) to 
show 



(7.41) 


so that again we see (for a real potential) Hetf becoming imaginary. 

How could these conclusions be avoided? We can reproduce sensible behaviour, but only 
if the potential is extremely contrived. At early times, we would need to arrange the scalar’s 
dynamics so that V' ^oo “before” (i.e., at a later Ues then) K crosses zero. ' We would then 

^The other obvious possibility, having dV/daes reach 0 before K does, is impossible given the forms of K{a) 
and V[a). 
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reach the initial singularity, Heff —> oo, before the kinetic term turns negative.® Moreover, we 
would need to tune the parameters of the theory so that K — 0 happens at extremely early 
times, specifically before radiation domination. At intermediate times, V' would need to scale 
in a particular way to [through equation (7.36)] reproduce aj and during 

the radiation- and matter-dominated eras, respectively. Finally, in order to have Hes —> const, 
at late times, we see from equation (7.40) that we would require V' to decay as We can 
construct such a potential going backwards by setting Hen — ^^acdm in equation (7.36), but 
there is no reason to expect such an artificial structure to arise from any fundamental theory. 
Even then we may still get pathological behaviour: Nes diverges if at some point Hesaen — Ha, 
cf. equation (7.10). 


7.6 Mixed Matter Couplings 

Before concluding, we briefly discuss a slightly different formulation which avoids some, but 
not all, of our conclusions. If we consider a scalar field and a perfect fluid, the avoidance of 
the no-go theorem on FLRW solutions only requires that the scalar field couple to g®®. In 
principle, all other matter could still couple to g^y. In fact, this is the theory that was studied 
in Ref. [73]. This theory violates the equivalence principle in the scalar sector, but is not a 
priori excluded, and will turn out to have slightly better cosmological behaviour. Moreover, 
there is a compelling theoretical reason to consider such couplings: matter loops would only 
generate a g-metric cosmological constant and would not destabilise the rest of the potential. 
However, the scalar field’s energy would still contribute to the cosmological constant for g®® 
and hence to all of the interaction terms unless, for example, this was forbidden by some 
symmetry. A massless scalar would be better behaved in this sense, but as we have shown 
above, such a scalar field will not avoid the no-go theorem because after integrating the 
Klein-Gordon equation, the pressure loses its dependence on the lapse. 

Because the perfect fluid couples to g^v and we derived the Bianchi constraint (7.13) by 

®This proposal has an interesting unexpected advantage: the Universe would begin at finite ^etf, so a UV 
completion of gravity might not he needed to describe the Big Bang in the matter sector. 
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taking the g-metric divergence of the Einstein equation, the constraint will now only contciin 
rather than the total pressure, i.e.. 


m^Ml^a^P{a)d — aPal^p^d. 


(7.42) 


This is the same constraint as in the scalar-only case discussed in section 7.4, so the scalar’s 
kinetic and potential energies have the same forms, K[a) and V[a), as in equations (7.24) 
and (7.25). The physical Hubble rate is now H, which after solving for the lapse is determined 
by the equation® 



(7.43) 


where the C/ coefficients are defined in equation (7.20). Because the scalar field does not 
have to respond to matter to maintain a particular form of p(a) and p[a], we no longer have 
pathological behaviour in the early Universe, where there will be a standard evolution. 
Moreover, as was pointed out in Ref. [173], there is late-time acceleration: as pm ^ 0, 3FP 
which, if positive, leads to an accelerating expansion. 

However, these are not always se//-accelerating solutions. We will demand two conditions 
for self-acceleration: that the late-time acceleration not be driven by a cosmological constant, 
and that it not be driven by H(/), both of which can easily be accomplished without modifying 
gravity. In other words, we would like the effective cosmological constant at late times to arise 
predominantly from the massive graviton. 

Let us start with the first criterion, the absence of a cosmological constant. Recall from 
section 2.1.2 that we can write the dRGT interaction potential in terms of elementary symmet¬ 
ric polynomials of the eigenvalues of either X = \fg~^ or K = I — X, with the strengths of the 
interaction terms denoted by the by-now familiar in the first case and by a„ in the latter. 
What is notable is that ao / y^o: the cosmological constant is not the same between these two 
parametrisations. Terms proportional to ^-g arise from the other interaction terms when 
transforming from one basis to the other. In bigravity there is a genuine ambiguity as to how 


®Using the transformations to the a 2 theory in footnote 5, we recover eq. (5.9) of Ref. [73]. 
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one defines the cosmological constant, and throughout this thesis, because we are concerned 
with cosmological solutions, we have chosen to identify the cosmological constant with the 
constant term appearing in the Friedmann equation for the physical metric. In massive 
gravity with a Minkowski reference metric, however, the presence of a Poincare-invariant pre¬ 
ferred metric allows for a more concrete definition of the cosmological constant.^'* Consider 
expanding the metric as 

g/iv = +2fi^v + hf,ahyfsiq^^^’ . (7.44) 


This expansion is useful because the metric is quadratic in but is fully nonlinear, i.e., 
we have not assumed that is small [49]. In this language, the cosmological constant 
term, proportional to ^J-g, can be eliminated by setting ao = ai = 0. Making this choice of 
parameter, and recalling, cf. equation (2.27), that and are related by [45] 


4 

^„=(4-n)!^ 

i=n 


(_!);+« 
(4- /)!(/ - 


(7.45) 


we find the effective cosmological constant can be expressed in terms of a 2 , 3,4 by 


Aeff— 









-as 



4“ a4 



(7.46) 


Part of this constant comes from the fixed behaviour of the scalar field potential. This 
piece is not difficult to single out: it consists exactly of the terms in equation (7.46) propor¬ 
tional to a/p. Taking the late-time limit of equation (7.25), we can see that)/(/) asymptotes 


to 


vix) 


fleff 

- > 




(7.47) 


Now consider the Friedmann equation in the form (7.8) with, at late times, p ^V. We can 


thank Claudia de Rham for helpful discussions on this point. 

^^Notice from equation (7.24) that, as in the case with a perfect fluid, the scalar field slows down to a halt at 
late times, so there is no contribution from the kinetic energy. 
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define a cosmological-constant-like piece solely due to the late-time behaviour of V given by 


ay 


V a 




(3a2 - 3a3-j-a4). 


(7.48) 



3 p 


Then equation (7.46) can simply be written in the form 




(6a2 — 4a3 -I- a4) -H + A^, 


Aeff— 


3 


(7.49) 


where in the last equality we mention that the residual term is nothing other than m^Po/3, 
which is simply a consistency check. 

The modifications to gravity induced by the graviton mass therefore lead to a constant 
contribution to the Friedmann equations at late times, encapsulated in m^Po/3 (with Qq — 
tti = 0). In a truly self-accelerating universe, this term should dominate A;^,. If it did not, the 
acceleration would be partly caused by the scalar field, and one could get the same end result 
in a much simpler way with, e.g., quintessence. For generic values of a„ and for yS ~ both 
of these contributions are of a similar size and will usually have the same sign. To ensure self- 
accelerating solutions, one could, for example, tune the coefficients so that 3a2 - 3a3 -f a 4 = 0 
(the scalar field contributes nothing to Aeff) or 3a2 - 3a3 -t- a 4 < 0 (the scalar field contributes 
negatively to Aeff), or take yS < 1 (the scalar field contributes negligibly to Aeff). 

7.7 Summary of Results 

One can extend dRGT massive gravity by allowing matter to couple to an effective metric 
constructed out of both the dynamical and the reference metrics. The no-go theorem ruling 
out flat homogeneous and isotropic cosmologies in massive gravity [94] can be overcome 
when a scalar field is “doubly coupled” in such a way [73, 173]. We have shown that this 
result is, unusually, dependent on the use of a fundamental field, such as a scalar field in 
the aforementioned references, as the no-go theorem is only avoided when the pressure of 
the matter coupled to depends on the cosmic lapse function. This lapse dependence 





178 


Chapter 7. Cosmological Implications of Doubly-Coupled Massive Gravity 


is not present for the types of matter usually considered in late-time cosmological setups, 
such as radiation ip ~ a”g) and dust (p = 0), and therefore a universe containing only such 
matter will still run afoul of the no-go theorem. While this may not he a strong physical 
criterion—cosmological matter is still huilt out of fundamental fields—it presents a sharp 
practical problem in relating the theory to cosmological ohservations. Furthermore, if one 
uses a scalar field to avoid the no-go theorem, it cannot live on a flat potential and must he 
rolling. The latter consideration would seem to rule out the use of the Higgs field to unlock 
massive cosmologies, as we expect it to reside in its minimum cosmologically. 

Overall, in principle one can obtain observationally-sensible cosmologies in doubly- 
coupled massive gravity, but either a new degree of freedom must be included, such as a scalar 
field or some other matter source with a nontrivial pressure, or we must treat cosmological 
matter in terms of their constituent fields. Thus we cannot apply the standard techniques of 
late-time cosmology to this theory. 

We have further shown that if dust and radiation are doubly coupled as well—which is 
necessary if we demand the scalar obey the equivalence principle—then the cosmologies 
generically are unable to reproduce a viable radiation-dominated era, and in the far future the 
Hubble rate diverges, rather than settling to a constant and producing a late-time accelerated 
expansion. These pathologies can only be avoided if the scalar field potential is highly 
contrived with tuned theory parameters, or dust and radiation do not doubly couple. In the 
latter case, there is generically late-time acceleration, but for much of the parameter space, 
this is in large part driven by the potential of the scalar field. In those cases the modification to 
general relativity may not be especially well motivated by cosmological concerns. Otherwise, 
the parameters of the theory need to be tuned to ensure that the theory truly self-accelerates. 

It seems that dRGT massive gravity only has viable cosmological solutions—i.e., that 
evade the no-go theorems on existence [94] and stability [100] —if one either includes a scalar 
field or some other “exotic” matter with a lapse-dependent pressure (or possibly a pressure 
depending on d) and couples it to the effective metric proposed in Ref. [73] or goes beyond 
the perfect-fluid description of matter. Even if one includes a new scalar degree of freedom. 
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significant pathologies arise if normal matter couples to the same effective metric. In all 
setups, the need for descriptions beyond a simple perfect fluid makes this theory unappealing 
from an observational standpoint. 

We end with three small caveats. Notice that we have assumed that in unitary gauge 
for the Stiickelberg fields, i.e., choosing coordinates such that = diag(-l, 1,1,1), the 
metric has the usual FLRW form (7.4). However, that form is arrived at by taking coordinate 
transformations of a more general homogeneous and isotropic metric, so that assumption 
maybe overly restrictive.^^ Equivalently, one could consider a more general, inhomogeneous 
and/or anisotropic, gauge for the Stiickelberg fields. 

We also note that if this theory does possess a ghost, even with a mass above the strong 
coupling scale, solutions to the nonlinear equations of motion could contciin the ghost mode 
and therefore not be physical. However, a Hamiltonian analysis showed that the ghost 
does not appear around FLRW backgrounds [73], suggesting that we have studied the correct 
cosmological solutions to any underlying ghost-free theory. 

Finally, as discussed in chapter 6, if one simply gives dynamics to the reference metric, we 
end up with a theory of doubly-coupled bigravity which treats the two metrics on completely 
equal footing and has been shown to produce observationally viable cosmologies. 


thank Fawad Hassan for pointing this out to us. 
thank Angnis Schmidt-May for discussions on this point. 





Part II 

Lorentz Violation 


Einstein was a giant. Elis head was in the clouds, but his feet were on the ground. 
But those of us who are not that tall have to choose! 


Richard Feynman 




The only thing that really worried me was the ether. There is nothing in the world more 
helpless and irresponsible and depraved than a man in the depths of an ether binge. And 
I knew wed get into that rotten stuff pretty soon. Probably at the next gas station. 

Hunter S. Thompson, Fear and Loathing in Las Vegas 


Lorentz Violation During Inflation 



For this final chapter, we move to the early Universe to ask what constraints we can put on 
the violation of Lorentz invariance during inflation. As discussed in section 2.2, we can use 
Einstein-aether theory (ae-theory) [121,122] to model Lorentz violation in the boost sector, i.e., 
while maintaining rotational invariance on spatial hypersurfaces, at low energies. In ae-theory 
Lorentz invariance is spontaneously broken by the presence of a vector field nonminimally 
coupled to gravity. A Lagrange multiplier enforces the constraint that this vector, sometimes 
called the aether and denoted here by be timelike and have fixed norm, 

— ( 8 . 1 ) 

where m is a free parameter with mass dimension 1. This forces the aether to acquire a 
nonzero vacuum expectation value (VEV) at every point in spacetime, so at every point the 
aether picks out a timelike direction and hence defines a preferred reference frame. Note 
that ae-theory is a vector-tensor theory of gravity and is, at the level of the theory, completely 
Lorentz invariant. It is the nontrivial constraint which ensures that Lorentz invariance, 
and specifically boost invariance, is always broken at the level of the solutions. Ai-theory 
corresponds, for example, to the low-energy limit of Hofava-Lifschitz gravity, a well-studied 
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candidate UV completion of general relativity which breaks the symmetry between time and 
space coordinates directly at the level of the action [112], 

In section 2.2.2 we considered a generalisation of ae-theory in which a scalar field, (j), 
couples to the aether by allowing its potential, ), to depend on the divergence or expansion 
of the aether, 0 = [129,183,184]. This is of particular interest for cosmology because 0 

is related to the local Hubble expansion rate: the aether is forced by symmetry to align with the 
cosmic rest frame in a spatially homogeneous and isotropic background [122,126, 127, 133], 
and purely on geometric grounds we find 9 —3mH, with H—d/ais the cosmic-time Hubble 
parameter. The ability to use the expansion rate so freely in the field equations is a departure 
from general relativity and other purely metric theories: in such theories, H is not a covariant 
scalar as it can only be defined in a coordinate-dependent way. Thus, this extension of pure 
ae-theory opens up the interesting possibility of cosmological dynamics depending directly 
on the expansion rate in a way that is not allowed by general relativity or many modified 
gravity theories. 

This coupling also allows the aether to directly affect cosmological dynamics at the level of 
the background. Recall from the discussion in section 2.2.3 that this is not possible in “pure” ae- 
theory as the aether tracks the dominant matter source and hence can only rescale Newton’s 
constant in the Friedmann equation, slowing down the expansion. By coupling the scalar 
field to 0 in the way discussed above, one can obtain qualitative changes in the cosmological 
dynamics, cf. equations (2.87) and (2.88). If we identify this scalar field with the inflaton, the 
coupling to the aether therefore modifies inflationary dynamics. In a simple case, it adds 
a driving force which can slow down or speed up inflation [129]. This theory with another 
simple form of the coupling is also closely related (up to the presence of transverse spin-1 
perturbations) to 0CDM, a dark energy theory in which the small cosmological constant is 
technically natural [116,117]. 

The type of coupling we have chosen—i.e., promoting V[(p] to V16,cj ))—may seem restric¬ 
tive, but it is in fact a reasonably general approach to coupling the aether to a scalar field. Any 
terms one can write down which do not fit in this framework would have mass dimension 
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5 or higher. Such terms would only he relevant at short distance scales and would not he 
power-counting renormalisahle. Therefore, from an effective field theory approach, all the 
operators we wish to include are captured in V{6 up to integration hy parts. We refer 
the reader to section 2.2.2 for a more in-depth discussion of the generality of this type of 
coupling. We will perform our analysis with the important assumptions that (j) drives a period 
of slow-roll inflation and that its kinetic term is canonical, hut otherwise leave its properties 
unrestricted. Hence we consider this theory to he a fairly general model of Lorentz violation 
in the inflaton sector. 

Our aim is to explore the effects of such a coupling at the level of linear perturbations to a 
cosmological background, and in particular to find theoretical and observational constreiints. 
For reasonable values of the coupling between the aether and the inflaton, these perturbations 
are unstable and can destroy the inflationary background. This places a constraint on the 
coupling which is several orders of magnitude stronger than the existing constraints. If 
the parameters of the theory are chosen to remove the instability, while satisfying existing 
constraints on the aether VEY then the effects of the coupling on observables in the cosmic 
microwave background will be far below the sensitivity of modern experiments. 

The remainder of this chapter is organised as follows. In section 8.1 we discuss the 
behaviour of linearised perturbations of the aether and the scalar around a (nondynamical) 
flat background, deriving a stability constraint (previously found by another method in 
Ref. [129]) which provides a useful upper bound on the aether-scalar coupling. In section 8.2, 
we set up cosmological perturbations in this theory; the real-space perturbation equations 
can be found in appendix D. In section 8.3 we examine the spin-1 cosmological perturbations 
of the aether and metric during a phase of quasi-de Sitter inflation. This demonstrates clearly 
the existence of a tachyonic instability, which we explore in some depth. In section 8.4 we 
look at the spin-0 perturbations, finding the same instability and calculating the scalar power 
spectrum. Unusually, isocurvature modes do not appear to first order in a perturbative 
expansion around the aether norm. We give a worked example in section 8.5 which elucidates 
the arguments made for a general potential in the preceding sections, and conclude with a 
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summary and discussion of our results in section 8 . 6 . 


8.1 Stability Constraint in Flat Space 

Before moving on to the main focus of this chapter, perturbations around a cosmological 
background, we briefly examine perturbation theory in flat space. Our goal is to derive a 
constraint on the coupling by requiring that the aether and scalar perturbations be stable 
around a Minkowski background. This will set an upper limit relating the coupling to the 
effective mass of the scalar, 

V^^{0,0]<2cu3Vi,^{0,01 (8.2) 

where, we remind the reader, C 123 = Ci + C 2 + C 3 , and analogously for similar expressions. We 
will find this bound on ^ 0 ,^( 0 , 0 )^ useful when we examine the cosmological perturbations. 
This result complements and generalises a derivation in Ref. [129], which used different 
methods and selected a specific form of V(0, ^), and will take as a starting point the method 
utilised in Ref. [127] for pure ae-theory. 

We assume that the potential is analytic around [0,(p] = (0,0), because if it diverges there 
the aether-scalar stress-energy tensor (2.81) will be nonzero and we cannot have a Minkowski 
solution. We will also assume that V(0,0) is either vanishing or negligibly small; if not, then 
this contributes a cosmological constant term to the stress-energy tensor, and our background 
is (anti-)de Sitter rather than flat. Observations constrain such a term, barring a nonlinear 
screening mechanism, to be very small. ^ 

In fiat space the field equations are solved by a constant-field configuration. 


— [m,0,0,0], 

(8.3) 

CD 

II 

(8.4) 

<p=0. 

(8.5) 


^The scalar field is canonical, coupled minimally to gravity, and not coupled at all to the matter sector, so we 
would not expect any screening mechanisms to he present in this theory. 
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We introduce small perturbations, {5 A, 5^}, defined by 

( 8 . 6 ) 

(8.7) 

( 8 . 8 ) 


A = A -|- 5 A, 
(j) =^ + 5(p. 


Writing the action (2.76) as 



we expand the Lagrangian to quadratic order, 


+ 5i^ + 52if. 


(8.9) 


( 8 . 10 ) 


where 5iS^ and are of linear and quadratic order, respectively. The background and 

linear Lagrangians recover the background equations of motion, leaving us with the quadratic 
Lagrangian, 


dz^ ^ -Cidi,v^d>"vy- Cz{df,v‘"f - Cz,di,v^ + 

-]^d^5(t)d>^54) - + + 2Ve^{%Q)54){di,vn], ( 8 . 11 ) 

whose variation yields the equations of motion of the perturbed variables. From here we drop 
the (0,0) evaluation on the derivatives of the potential (although they remain implicit). The 
5 A equation of motion is 

= ( 8 . 12 ) 
It constrains the timelike component of the aether perturbation to vanish. 


(8.13) 
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Inserting this result into equation (8.11) and splitting v‘ into spin-0 and spin-1 fields^ as 

v'=S‘+N\ (8.14) 

where the spin-0 piece S' is the divergence of a scalar potential (S' = d’Y) and the spin-1 
piece iV' is transverse to S' (^,iV' = 0), we find that the quadratic potential decouples for these 
two pieces, 

daif + (8.15) 


where the spin-0 Lagrangian is 


^(0) = CiS^ - CidiSJd‘Sj - C2{diS‘f - c^diS^djS^ 

+ ]^[5’(i)^-5^idi5(i)dj5(t))-]^[Vee{diS'f + V^^5(i)^ + 2Ve^54){diS^)\ (8.16) 

and the spin-1 Lagrangian is 

£^^^^^CiN^-CidiNjd‘Nj. (8.17) 

We have eliminated the cross-terms between the spin-0 emd spin-1 pieces, and the C 3 term in 
the spin -1 piece, using integration by parts. 

Notice that a consequence of the spin-1 perturbation iV' being divergence-free is that the 
scalar-field coupling does not affect the spin-1 Lagrangian, because (p only couples to the 
aether through 0 = In particular, this allows us to use the constraint 

Ci>0 (8.18) 


from the start. This was derived in pure ee-theory from requiring positivity of the quantum 


^The aether perturbation is in a reducible subgroup of SO(3), so by decomposing v' like this we single out the 
real dynamical degrees of freedom. Note also that, here and throughout this chapter, we will refer to scalar and 
vector modes of the aether as spin-0 and spin-1, respectively, so as not to confuse them with the scalar field 0 
and vector field nt*. 
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Hamiltonian for both the spin-0 and spin-1 fields [127], and is suggested by the fact that 
for Cl < 0 the kinetic terms for S' and iV' in equations (8.16) and (8.17) have the wrong sign. 
Since this was proven to be true for the spin-1 perturbations in ae-theory and they remain 
unchanged in this extension of it, this condition on Ci must continue to hold. 

Finally, we can vary the action with respect to our three perturbation variables—S' , iV' , 
and 5(p —to obtain the equations of motion. 


S' - ^ 0 , 

Cl 2 ci ^ 

(8.19) 


( 8 . 20 ) 

n5(p - Yp^5(p - Ve^diS' ^0. 

( 8 . 21 ) 


In ae-theory (p —0 — V{9, (p ) and both aether equations are simply wave equations with plane 
wave solutions [127], 

S'(k)(xe-'‘'™*^'+'H (8.22) 

(8.23) 


with the propagation speeds for the spin -0 and spin -1 perturbations given by 


( 0)2 _ 

" Cl 

cW2 = l. 


(8.24) 

(8.25) 


The scalar coupling modifies the ae-theory situation in two ways. First, C 123 is shifted by 
I Fee evaluated at (0 = 0 , ^ = 0 ) (remember that implicitly we are evaluating all the derivatives 
of V there, so they are just constants). This is to be expected: the expansion of the potential 
around ( 0 , 0 ) includes, at second order, the term ^VeeSO^ — ^ Veei^iS^Y, which can be absorbed 
into the C 2 term in the (quadratic) Lagrangian by redefining C 2 —> C 2 -H | Fee- We will find this 
same redefinition of C 2 appears in the cosmological perturbation theory. 
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The second change from ae-theory is more significant for the dynamics. When is 
nonzero—i.e., when the coupling between and cj) is turned on—it adds a source term to 
the wave equation for S' (the N' equation is unmodified because neither 0 nor cj) contain 
spin-1 pieces, as discussed above). Similarly, a -dependent source is added to the quadratic 
order Klein-Gordon equation for 5(j). 

The equations of motion for S' and 5^ are those of two coupled harmonic oscillators. To 
simplify these, we move to Fourier space, where the spin-0 degrees of freedom S^(f) = d 
and Sep kit] obey the coupled wave equations (dropping the k subscripts and absorbing ^Vge 
into C 2 ): 


f + Vg^5(i)^Q, 

2 ci 

(8.26) 

5(j) + ik^ + V^^)5(j) - Vg^k^y^O. 

(8.27) 


This system can be diagonalised^ by defining 


f=r-F 


Vg, 


2ci + 


5(p, 


Vg^k^ 


where the co± are defined by 



Under this transformation, the equations of motion are simply 


2Ve%k^ 

Cl 


(8.28) 

(8.29) 


(8.30) 




(8.31) 


5~(p + co^^5~(p — 0. 


(8.32) 


^We thank the referee of Ref. [6] for this suggestion, which simplifies an earlier version of the calculation 
while obtaining the same result. 
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Note that in the limit Ve^ 0 where the two fields decouple, co^ goes to the squared 

frequency of a 5^ mode, and col_ goes to the equivalent for y modes. We see that y and 
5~cj) are noninteracting, mbced modes which reduce to y and 5(p, respectively, in the absence 
of a scalar-aether coupling. 

For stability, we require co± to be real, so that the solutions to equations (8.31) and (8.32) 
are plane waves rather than growing and decaying exponentials. Note that co+ is manifestly 
real, so the 5~(p modes are always stable. It is the aether modes, which can be destabilised 
by the coupling to the scalar, while the reverse is not true. Stability imposes a constraint on 
Ve(py 

Vg^^<2c,c^°^\k^ + V^^l (8.33) 

which, since we would like it to hold for arbitrarily large-wavelength modes (i.e., arbitrarily 
small k), can be written, substituting back in the definition = Cus/ci, 


( 0 , 0 )< 2 ci 23 l^^^( 0 , 0 ), 


(8.34) 


where for clarity we have put back in the — ( 0 , 0 ) evaluation which has been implicit. 
Equation (8.34) constrains the coupling between the aether and the scalar field in terms 
of the aether kinetic-term free parameters (or, equivalently, its no-coupling propagation 
speed) and the effective mass of the scalar in flat space. It agrees with the spin-0 stability 
constraint in Ref. [129] which was derived in a slightly different fashion for a specific form of 
V{d,cj)].^ The Ci are dimensionless, so we might expect them to naturally be ^(1). Assuming 
this, equation (8.34) roughly constrains the coupling I^,^(0,0) to be less than the scalar field 
mass around fiat space. Note that this constraint also implies C 123 > 0,® which is the combined 
constraint from subluminal propagation and positivity of the Hamiltonian of the spin-0 field 


^Technically, the mixed aether-scalar modes which become arbitrarily close to the aether perturbations in 
the limit Vg^^O. 

®Our notation is different than that used in Ref [129] and as a result their constraint looks slightly different. 
They define the aether to be dimensionless and unit norm while we give it a norm m with mass dimensions. 
To compensate for this, their c,- are IGnGm^Ci in our notation. We have checked, translating between the two 
notations, that our constraint matches theirs. 

^Assuming that the scalar field is nontachyonic. 
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in pure ae-theory [127], 


8.2 Cosmological Perturbation Theory 

The goal of this chapter is to explore the impact of the coupling between cj) and on small 
perturbations to a homogeneous and isotropic cosmology. We will be particularly interested 
in a period of slow-roll inflation driven by ^. As has been explored in great depth over the past 
three decades, a scalar field rolling slowly down its potential can lead to cosmic inflation and 
all of the interesting cosmological consequences for explaining the structure of the observed 
universe that follow from it [185]. In this section we set up the linearised perturbation theory 
for the metric and the scalar around a homogeneous and isotropic universe. Recall that 
the background equations of motion, i.e., the Friedmann and Klein-Gordon equations, are 
presented in section 2.2.3. 

8.2.1 Perturbation Variables 

Let us consider linear perturbations about the FRW background (2.83), defined by 

ds^ — a^{z) — ZBidzdx' + [(1 +2^)5 ,7 +2HTij]dx‘dx^^, (8.35) 

so the components of the perturbed metric are 


goo — —<2^(1 +2$), 


goi^-a^Bi, 


gij — a^[{\ +2'^)5ij +2HTij]- 


(8.36) 


Inverting, and keeping terms to first-order we have 


g00 = _a-2(l-2$). 
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gO' = 

-2H‘^]. (8.37) 

Indices on spatial quantities like 5; are raised and lowered with 5ij. The Christoffel symbols 
are (with background parts in bold) 

r«. = (.>^f(1 + 2^) + - 2 5ij + + i2M’HTij + 

r'. = J^S‘ + 5‘'^B[j,k] + ^'5] + H'^, 

r.^ = + + ^ + . - HTjk’\ (8.38) 

where primes denote deriatives with respect to t. We do not reproduce the components of 
the Einstein tensor here; they can be found in the literature [186]. 


The aether in the background has only u° — Imposing the constant-norm constraint, 
u^ui^ — —m^,to first order the aether is given by 

= (8.39) 

where is the spatial perturbation to the aether. With lowered indices we have 


+ Bi). 


(8.40) 


Taking the divergence of equation (8.39) we can find the linearised expansion, 




(8.41) 
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Finally, the scalar field (j) is split into a background piece and a small perturbation, 

(l) — (p+5(p, (8.42) 

where ^ satisfies the Klein-Gordon equation in the background metric. Throughout this 
chapter we use overbars to denote background values. 

8.2.2 Linearised Equations of Motion 

In deriving the perturbation equations we will need to expand V{0,(p] around its background 
value. To second order, assuming 5^ is similar in size to the metric perturbations, we have 

V(0,<^) = V+ VedO + %5(l) + ^ [Vggde^ + %^5(l)^ + 2Vg^5e5(P] +0(50^), (8.43) 

where, per equation (8.41), the linear piece of the expansion is given by 

59 = — (34'' - + V\i) . (8.44) 

In deriving the linearised field equations we will need V{6 ,(j)) and all of its derivatives up to 
first order. 


v{e,(i)) = v+Vg5e + %5(i)+o{5e\ 

Vg{G,4))=Vg + Vgg5G + Vg^5(j) + 0 ( 50 ^), 

V^iG,(j)) = % + + %5G + Oi5e^l 

Vgg[9,(P] = Vgg + VgggSG + Vgg^5 (j) + 0(5 G^), 

Vg^iG,(j)) = % + Vgg^SG + Vg^^S (j) + 0(5 Gf . (8.45) 


The linearised equations of motion in real space are given in appendix D. However, the 
symmetries of the FRW background allow us to decompose the perturbations into spin-0, spin- 
1, and spin-2 components [187]. In particular, because the background variables (including 
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the aether, which points only in the time direction) do not break the SO (3) symmetry on 
spatial slices, these components conveniently decouple from each other. Hence we perform 
this decomposition both to isolate the fundamental degrees of freedom and to make close 
contact with the rest of the literature on cosmological perturbation theory. 

We decompose the variables as 


k 

(8.46) 

k 

(8.47) 

k 

(8.48) 

yi — y^rn]Yi[±m) 

k m=0,\ 

(8.49) 

k m=0,l 

(8.50) 


(8.51) 


k m=0,l,2 


where etc., are eigenmodes of the Laplace-Beltrami operator, + k'^ (see Refs. [127,186] 

for the forms of these mode functions cind some of their useful properties). From here on, we 
will drop the k subscripts. The spin-0, spin-1, and spin-2 perturbation equations can then be 
found by plugging these expansions into the real space equations listed in appendix D. 


8.3 Spin-1 Cosmological Perturbations 

We begin our analysis by focusing on the spin-1 perturbations. The spin-2 perturbations are 
unmodified by the aether-scalar coupling because V[0,(p) only contains spin-0 and spin-1 
terms. The only physical spin-2 perturbations are the transverse and traceless parts of the 
metric perturbation Hjij, or gravitational waves, and they behave as they do in pure ae-theory 
[127]. The spin-0 perturbations, discussed in section 8.4, are more complicated than the 
spin-1 perturbations due to the presence of 5^ modes. The important physical result—the 
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existence of unstable perturbations for large, otherwise experimentally-allowed regions of 
parameter space—will therefore be easier to see and understand in the context of the simpler 
spin-1 modes. 


The only nontrivial spin-1 component of the aether field equation (2.77) is v = i, 
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while the spin-1 perturbations of the stress-energy tensor are 
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where a — [ci 3 + 3c2)m^ is defined in equation (2.75). As a consistency check, these expres¬ 
sions reduce to those found in the literature for a scalar field uncoupled to the aether [139] 
(setting V{d,4))— V(^)) and for ae-theory [127] (setting V{d,4)] —ad^,with C 2 -^ C 2 + a). For 
convenience, from here on we will absorb ^Vgg into C 2 and indicate the change with a tilde. 


e.gv 
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(8.56) 


and similarly for quantities like Gc- While this is convenient notation we should remember 
that Vgg and hence all tilded quantities are not necessarily constant, although they are nearly 
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so during a slow-roll phase.' 

We should first note that due to its direct coupling to the aether, the scalar field does 
source spin-1 perturbations, which is impossible in the uncoupled case as the scalar field 
itself contains no spin-1 piece. In pure ae-theory the spin-1 perturbations decay away as 
a~^ [127]. We may wonder if the scalar-vector coupling can counteract this and generate a 
nondecaying spin-1 spectrum. 

Using the gauge freedom in the spin-1 Einstein equations, we choose to work in a gauge 
where = 0; that is, we foliate spacetime with shear-free hypersurfaces. The i-j Einstein 

equation in the spin-1 case is unmodified from the ae-theory case [127] and gives a constraint 
relating the shift and the spin-1 aether perturbation 


where 

7 = 167rGm^Ci3. (8.58) 

It is tempting to notice the similarities between the i> — i aether field equation (8.52) and the 
0-i Einstein equation (8.54), but this is just hinting at the underlying redundancy between 
the two equations. Indeed, using equation (8.52) to eliminate the scalar field term in equa¬ 
tion (8.54) just leaves us with an identity. This is because, due to the constraint equation 
(8.57), the two perturbations B and V are related, and hence (by the Bianchi identities) these 
two equations have to contain the same content. We choose to use the 0-i Einstein equation 
to derive our equation of motion for the spin-1 perturbations. In this equation, the scalar 
field couples to the vector perturbations of the aether and the metric via ^Vetj,(p'. In the 
quadratically-coupled potential of Donnelly and Jacobson, which we discuss in detail in 
section 8.5, the coupling Vofj, is exactly constant. In general, we will take Vecj) to be constant to 
first order in slow roll. 


nonconstant Vgg requires cubic or higher order terms in the potential. For the quadratic Donnelly- 
Jacobson potential discussed in section 8.5, Vgg is constant and can be freely set to zero by absorbing it into 
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Inserting the constraint into the 0-/ Einstein equation we find 
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(8.59) 


Following Ref. [127], we define ^ to eliminate the first-derivative terms, so equa¬ 

tion (8.59) becomes 


r + cf + (a-^ — ? = 0, (8.60) 

V w Cl 2mc\ j 

where the no-coupling sound speed is the de Sitter propagation speed of the spin-1 
aether and metric perturbations when the coupling to the scalar field is absent [127], 



(1 ~ C 3 /Cl) -t- 
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(8.61) 


The background quantity A is defined by 


, a 

A = - 
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(8.62) 


and vanishes in exact de Sitter space. 

8.3.1 Slow-Roll Limit 

The equation of motion (8.60) for the spin-1 aether and metric perturbations is difficult to 
solve in full generality. It was solved in pure de Sitter space {A — 0) in ae-theory (i.e., in the 
absence of the scalar field) in Ref. [127]. In that limit, equation (8.60) is a wave equation 
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with real frequency, so ^ was found to be oscillatory. Therefore, in ae-theory the spin-1 shift 
perturbation, — decays exponentially,® leaving the post-inflationary universe 

devoid of spin-1 perturbations. To investigate whether the inflaton coupling term will change 
this conclusion, let us solve equation (8.60) in the slow-roll limit. 

We define the slow-roll parameters, s and rj, in the usual way, 

_ H _ 

6 s' 

Ws^Ws’ 

where for completeness we have included both the cosmic-time and conformal-time defini¬ 
tions. Slow-roll inflation occurs whenever s,rj<.l. In this limit, both parameters are constant 
at first order and we can find 


(8.63) 

(8.64) 


—^(l-Hf), (8.65) 

Ht 

1 

-(1 + f). (8.66) 

T 

Taking conformal-time derivatives we can calculate the background quantity defined above, 

(8.67) 

Note that during slow roll, H is approximately constant but is not; therefore, even though 
we are working in conformal time, we will often choose to write the equations of motion and 
their solutions in terms of H, treating it as a free parameter which measures the energy scale 
of inflation. 

Using these relations, as well as the Klein-Gordon equation in the slow-roll limit and the 


®Here and in the rest of this chapter, “exponential” growth or decay should he taken to mean exponential in 
cosmic time, or as a power law in conformal time. 
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fact that — aH,we can write the ^ equation of motion to first order in slow roll as 
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( 8 . 68 ) 


V{6,(j)) and its derivatives will be constant at leading order in the slow-roll parameters, so 
if we ignore G{e) terms then and in equation (8.68) are constants. Our equation of 
motion for the spin-1 perturbations can then be written simply as 


A 




(8.69) 


where we have defined the constant 


6mciH^ 




(8.70) 


We have also assumed that A dominates a£/[m^Ci) ~ £ (which is ~ ^(10“^) [36, 188]), the 
term from pure ae-theory. In principle this need not be true, if the coupling term were 
extraordinarily small. If the aether-scalar coupling is to do anything interesting, then Vg^ 
must be larger than that, so we will continue to assume that it is. 

Finally, let us identify the effective mass of the spin-1 aether perturbation, V — ^/a. 
Writing the equation of motion (8.69) in terms of V, switching to cosmic time, and working to 
leading order in slow roll, we find 

V + 3HV + i2-A)H^V+^—V^0. (8.71) 


Outside the sound horizon, Csk <. a, we find that V has an effective mass in the slow-roll 
regime given by 




6mciH' 


(8.72) 


We expect a tachyonic instability to develop for negative i.e., for A > 2. We proceed to 
demonstrate that just an instability arises. 
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8.3.2 Full Solution for the Vector Modes 


Noticing the similarity between equation (8.69) and the usual Mukhanov-Sasaki equation 
[139], which has solutions in terms of Bessel functions, we change variables to g = 
with X = to recast equation (8.69) as Bessel’s equation for g(x), 

with the order v given by 

9 1 

v^ = -+A. (8.74) 

4 

Depending on the sign and magnitude of A, the order v can be real or imaginary. We will find 
it convenient to write the general solution in terms of the Hankel functions as 

^ [akH^^\-cf"^'*kz) + PkHf\-cf"^^kz)]. (8.75) 


To determine the values of the Bogoliubov coefficients, at and Pk, we need to match this 
solution in the subhorizon limit, — —> oo, to the quantum vacuum state of the aether 

perturbations in flat space. This is desirable because we can assume that, at such short 
wavelengths, these modes do not “see” the cosmic expansion. In section IV.B of Ref. [127] the 
quantum mode functions Nk for the aether perturbation v' were demonstrated to satisfy 


Nk^ ^ (8.76) 

4Vlci|A: 

This function is related to Nk — ^V — The mode Nk is defined in Minkowski space, 
where a = 1 with t = z, so we only need to modify it by a factor of m to obtain Using the 
asymptotic formula 


lim H^^’^\-cf^^kz] 
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(8.77) 
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with 5 — ‘^{v + \/2), we find that in the subhorizon limit, 






(8.78) 


Matching to equation (8.76), and ignoring the unimportant phase factors we see that we 
need 


CCk 


— — 
4m V |ci|’ 


I3k = 0, 


(8.79) 

(8.80) 


where we have (consistently) put in some factors of which do not appear in the fiat-space 
calculation because it ignores gravity, but would have appeared if we had included gravity.® 
Substituting in this value of a^, we find the full solution for the spin-1 perturbation 

= -^V^HW(-cfi^A:T). (8.81) 

As a consistency check, if we turn off the scalar-aether coupling, we have v = 1/2, and (up to 
an irrelevant phase of-7r/2) we recover equation (91) in Ref. [127]. 


8.3.3 Tachyonic Instability 


On superhorizon scales, the Hankel functions behave as 


lim 




-V 


(8.82) 


®To see this, consider equation (8.60) in the case a —I, which is the spin-1 perturbation equation in flat space 
with gravitational perturbations turned on. Since this requires (p —0, the equation of motion (8.60) just becomes 
-f = 0. This has the same solution as we found in the case with gravity turned off in section 8.1, but 

with the sound speed modified, as expected. 
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Plugging this into equation (8.81), we see that the large-scale vector perturbations to the 
aether and metric depend on time as 


^-/^(-t) ’'’~(-t)2 


(8.83) 


When the aether-scalar coupling (proportional to A) is small or absent, such that -1/4 < 
A < 2, the vector perturbations decay and are unobservable, as in pure ae-theory [ 127] . If A 
is outside that range, then the coupling is large enough to change the nature of the vector 
perturbations. The coupling has two possible effects, depending on its sign. If v is imaginary 
(A < -1/4), then the vector modes are both oscillatory and decaying.i° This corresponds to a 
large coupling which significantly damps the perturbations. On the other hand, the vector 
modes will experience runaway growth if 3/2 — v is real and negative, or A > 2. In this case 
the coupling is large, but with the opposite sign to the previous case, and this large coupling 
drives runaway production of aether modes. This is precisely the tachyonic instability we 
anticipated in section 8.3.1, as it results from the aether perturbations acquiring an imaginary 
effective mass. 

Since this growth is exponential (in cosmic time, or in number of e-folds), it seems quite 
probable that this growing vector mode will overwhelm the slow-roll background solution 
and therefore lead to an instability. In this subsection we will calculate the growth of a single 
vector mode and compare it to the background evolution. 

In order to maintain a homogeneous and isotropic background spacetime, the time-space 
term in the stress-energy tensor must be zero at the level of the background if°i — 0). The 
spin-1 perturbations do contribute to these terms in the stress-energy tensor (8.54) through 
terms proportional to In particular, we will focus on the scalar-aether coupling 


term 



(8.84) 


^“Recall that, during inflation, t runs from —oo to 0. 
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which we will write as 


’fO ml/ ■r/(=l=l)'^(±l) 


(8.85) 


While we focus on this term for simplicity, we note that there are many other terms in T°i 
which receive contributions from the vector modes, and some may even be larger than the 
term in equation (8.85). 


Our strategy will be to focus on a single mode, picking one of the larger modes available to 
us. Because 14 grows with decreasing k, we choose a mode which crosses the sound horizon 
at some early conformal time t, . Such a mode has wave number 


A: = 



( 8 . 86 ) 


The perturbation 1^^'*'^^ is given by equation (8.81), which for a superhorizon perturbation 
becomes 

- ^^r(v)2" (-T,)i (8.87) 

where N is the number of e-folds after the mode crossed the sound horizon. 


The mode function is given by [127] 
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[k xn^.T i [k x nj ,j 



( 8 . 88 ) 


where if is a unit vector orthogonal to k. We can always choose three orthogonal coordinates 
such that k‘ — k5\ and n‘ — 5^2, so the mode function is 



l=Fi 

/2 




(8.89) 


This oscillates throughout space; we will choose x such that Re [(i ± l)e'^ ^j has its maximum 
value of 1. (The other terms in 14^'*'^^ manifestly real.) 


Therefore, this particular mode has a contribution to the 0-z component of the stress- 
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energy tensor which includes a term 




a 2n 


4m 


a/2 


(8.90) 


Using the slow-roll equation for SM’cp' — a^l4, we can write this as 


24n^\c,\ 


(8.91) 


Comparing this to the background 0-0 component of Tf^y, T°o — p — 3FP/8nGc, we find 






9 ^ 2 ^^ 


(8.92) 


Using the slow-roll Friedmann equation we could rewrite this purely in terms of the potential 
as 

T°i,k ^ - 1_ T{vW-H-Tif^ (8.93) 


T% 2471 ^^- 0^6 


The key feature here is the exponential dependence on N for v > 3/2 (the condition we 
found above for to grow exponentially in cosmic time). While the derivatives of the 
potential in the numerator of equation (8.93) should be a few orders of magnitude smaller 
than the potential in the denominator due to slow roll, this is likely to be dwarfed by the 
exponential dependence on the number of e-folds, which even for the bare minimum length 
of inflation, N ~ 50-60, will be very large. Moreover, as we will see in section 8.5, v can in 
principle be larger than 3/2 even by several orders of magnitude, hence the other terms with 
exponential dependence on v, as well as the gamma function, can be quite large as well. 


Therefore, when v > 3/2 the vector modes will generically drive the off-diagonal term in 
the stress-energy tensor far above the background density. This does not necessarily mean 
that isotropy is violated. As discussed in section 8.4, the same physical process that drives 
will similarly pump energy into the spin-0 piece, which affects the perturbations to 

as well as 7’°,. Consequently, background homogeneity and isotropy could still hold, but 
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the slow-roll solution to the background Friedmann equations which we perturbed would be 
invalid. Either way our inflationary background becomes dominated by the perturbations. 

Note that this calculation was done for a single mode, albeit one of the largest ones 
available because H grows for smaller k. Integrating over all modes produced during inflation 
would of course exacerbate the instability. 

We will explore this instability in greater quantitative detail in section 8.5, where we 
examine a specific potential for which we can elucidate the constraints on and Vg^. 


8.3.4 What Values Do We Expect for A? 


has an effective mass-squared (8.72) which depends on both the theory’s free parameters 
and derivatives of the scalar potential, and can be of either sign. When it is negative, the 
aether modes are tachyonic and contains an exponentially growing mode. This occurs 
when the parameter A, defined in equation (8.70), satisfies A > 2. To leading order in the slow- 
roll parameters, A is written in terms of several free parameters: Ci,m,H, and the potential 
derivatives Vg^ and 


. VsVed) . 

A = -——^ + 


(8.94) 


6mcitP 

Hence A can span a fairly large range of orders of magnitude. However, there are several 
existing constreiints on these pareimeters, most of which constrain several of them in terms of 
each other. 

There are two things we can do to clarify our expression for A. We generally expect that for 


a slow-roll phase, ^ < 3Hep and where the Planck mass is given as usual by 


M“2 = 87rG = 87rGc-^(l). 


(8.95) 


We will rewrite the second inequality in terms of a slow-roll parameter, as 




(8.96) 
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Using the slow-roll Friedmann and Klein-Gordon equations, we can then rewrite A as 

A = sgn(,^)^[^] + (8.97) 

Next, we can redefine the coupling using the fiat-space stability constraint (8.34) that we 
derived in section 8.1. Let us define a dimensionless coupling a by 


Ug2^(0,0) = 2ci23M2cr, (8.98) 

where = 1^0(0,0) is the effective mass-squared of the scalar around a Minkowski back¬ 
ground, so that the stability constraint is simply 

cr<l. (8.99) 


Therefore, we have 


A = sgn(^)(^^''^cr^''^ 


cf Ve^ Mo 
/^V0^{O,O] H 





( 8 . 100 ) 


The instability occurs when A > 2. Let us first examine equation (8.100) to see the condi¬ 
tions under which it is positive. Most of the terms are manifestly positive. Positivity of the 
Hamiltonian for spin-1 perturbations in fiat space requires Ci > 0 [127].^^ Tachyonic stability 
of the scalar requires Mo to be real and positive. The timelike constraint on the aether requires 
that m be positive as well. Putting this all together, we find 


A = sgn(()i) 
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( 8 . 101 ) 


>0 


implying that in order for A to be positive, (p and the coupling Ve^ need to have the same 


^Hhis was derived in pure ae-theory. However, recall from section 8.1 that the spin-1 modes in flat space are 
unaffected hy the scalar-aether coupling: spin-1 perturbations are by construction divergence-free, so 6 only 
contains spin-0 aether perturbations. Hence Ci must still be positive. 
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sign. This is not difficult to achieve in practice; in the quadratic potential of Ref. [129] (see 
section 8.5 for more discussion), it amounts to requiring that (j) and Vg^ have opposite signs, 
which is true for a large space of initial conditions leading to inflating trajectories. 

Next we need to see under which conditions |A| can be ^(1) or greater. We have assumed 
that the scalar slow-roU parameter ^ is small. In particular, in the absence of the aether, ^ is 
equal to s, which observations constrain to be ~ ^(10“^) [36]. It therefore seems sensible that 
should be small but not terribly small, perhaps ~ ^(10“^) or so. 

Similarly, the scalar-aether coupling, a, is constrained by the flat-space stability of the 
spin-0 modes to be strictly less than 1. However, we do not want to consider couplings so 
small as to be uninteresting, so we may choose the coupling to be as close to cr = 1 as is 
allowed. Therefore, cr^/^ ought to be smaller, but need not be too much smaller, than 1. 

Written in the form of equation (8.100), the value of A is sensitive to how Vg^ and 
differ between a quasi-de Sitter inflationary background and a Minkowski background. In the 
quadratic potential V[0,(p) — + ij,0(p which we discuss in section 8.5, both of these 

are constant, although one could construct inflationary potentials for which this is not true. 
The effective mass of the scalar during inflation, M = should be less than the Hubble 
rate in order to produce perturbations. Putting all this together, we are left with 





( 8 . 102 ) 


<1 <1 <?( i ) 


<1 »i? 


We can see that in order for A to be larger than 2, the aether VEV, m, needs to be at least a few 
orders of magnitude smaller than the Planck scale, m is effectively the Lorentz symmetry¬ 
breaking mass scale. It can therefore be quite a bit smaller than the Planck mass, although if 
it were below the scale of collider experiments, any couplings to matter could displace the 
aether from its VEV and Lorentz-violating effects could be visible. 

There are several experimental and observational results suggesting that m/Mpi should 
be quite small. Here we briefly discuss three strong constraints, arising from big bang nucle¬ 
osynthesis, solar-system tests, and the absence of gravitational Cerenkov radiation, as well as 
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a possible caveat. 

As mentioned in section 2.2, the gravitational constant appearing in the Friedmann 
equations, Gc, and the gravitational constant appearing in the Newtonian limit, Gn, are 
both displaced from the “bare” gravitational constant, G, by a factor that is, schematically, 
1 + C;(m/Mpi)2. The primordial abundances of light elements such as helium and deuterium 
probe the cosmic expansion rate during big bang nucleosynthesis, which depends on Gc 
through the Friedmann equations. Therefore, by comparing this to Gjv measured on Earth and 
in the solar system, can be constrained. Assuming the c, are ^(1),^^ the BBN constraint 
implies m/Mpi < 10“^ [126]. 

Slightly better constraints on Gc/Gjv come from the cosmic microwave background (CMB) 
[189, 190]. The tightest bound, jGjv/Gc - 1] < 0.018 at 95% confidence level, was computed 
using CMB data (WMAP7 and SPT) and the galaxy power spectrum (WiggleZ) in a theory 
closely related to the one described in this chapter, and should hold generally for ee-theory 
at the order-of-magnitude level [117]. These constrain m/Mpi to be no greater than a few 
percent. 

There are yet stronger bounds on m/Mpi through constraints on the preferred-frame 
parameters, ai, 2 , in the parametrised post-Newtonian (PPN) formalism. These coefficients 
scale, to leading order, as Ci{m/M^{f [122, 191]. The observational bounds ai < lO”'^ and 
a 2 < 4 X 10“^ therefore imply m/Mpi < 6 x 10“"^. Recent pulsar constraints on a 12 are even 
stronger than this [192], although they are derived in the strong-field regime and thus might 
not be directly applicable to the weak-field ae-theory results. Similarly, recent binary pulsar 
constraints on Lorentz violation [193] constrain m/Mpi < 10“\ assuming C/ ~ ^(1). 

The strongest constraints come from the absence of “gravitational Cerenkov radiation.” 
Because the aether changes the permeability of the vacuum, coupled aether-graviton modes 
may travel subluminally, despite being nominally massless. Consequently, high-energy 
particles moving at greater speeds can emit these massless particles, in analogy to the usual 

the Ci are dimensionless parameters, this is perfectly reasonable. Note that even if m were order Mpi or 
larger and the constraints discussed in here are actually constraints on the smallness of the c,-, A still depends on 
these parameters as 
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Cerenkov radiation. This emission causes high-energy particles to lose energy, and at an 
increasing rate for higher-energy particles. Among the highest-energy particles known are 
cosmic rays, which travel astronomical distances and hence could degrade drastically due to 
such gravitational Cerenkov effects. Such a degradation has, however, not been observed; this 
generically constrains m/Mpi < 3 x 10“® [124]. 

We should note that these constraints can be side-stepped if certain convenient exact 
relationships hold among the C/, although crucially they cannot all be avoided in this way 
simultaneously without allowing for superluminal propagation of the aether modes [ 122 ]. 
The PPN parameters ai ,2 are identically zero when c^ — O and 2ci = — 3 c 2 . The BBN constraint 
is automatically satisfied by requiring 2ci -|- 3 c 2 + Cs to vanish, as this sets Gc — Gn [126]. Note 
that the PPN cancellations imply the BBN cancellation, though the reverse is not necessarily 
true.^^ The Cerenkov constraints vanish if all five dynamical gravitational (metric cind aether) 
degrees of freedom propagate exactly luminally. This happens when C 3 = -Ci and C 2 — 
Ci/(1 - 2ci) [124]. Note that while a 2 = 0 in this parameter subspace, ai = -8ci(m/Mpi)2, 
which would place a constraint on m/Mpi of order 10“^. It is worth mentioning that the 
Cerenkov constraints on m will also be avoided if the mode speeds for some of the aether- 
metric modes are superluminal. This includes a two-dimensional parameter subspace in 
which the PPN and BBN constraints are automatically satisfied [122]. Whether superluminal 
propagation is acceptable in ae-theory is somewhat controversial. It is a metric theory of 
gravity, so superluminality should imply violations of causality, including propagation of 
energy around closed timelike curves [124,127]. However, this may be seen as an a posteriori 
demand, and some authors (see, e.g.. Ref. [122]) do not require it. 

It is unclear what fundamental physical principle, if any, would cause the C/ to cancel in 
any of the aforementioned ways. Hence it seems to be a fairly general result that m must be 
several orders of magnitude below the Planck scale. If m /Mpi is small enough compared to 
M/H and the other small parameters appearing in equation (8.102), A can easily be above 2 

^^The conditions for PPN and BBN to cancel can be relaxed by including a C4 term which describes a quartic 
aether self-interaction. We have ignored such a term in order to simplify the theory, although like the other three 
terms, it is permitted when that the aether equations of motion are demanded to he second order in derivatives. 
When C4 / 0, the vanishing of ai,2 continues to imply that the BBN constraints are satisfied. 
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and the aether-inflaton coupling runs a serious danger of causing an instability. For a given 
m/Mpi, this places a constraint on the size of the coupling Vg^. We will discuss this constraint 
more quantitatively in section 8.5 for a specific choice of the potential. 


8.4 Spin-0 Cosmological Perturbations: Instability and Ob¬ 
servability 

Let us now consider the spin-0 perturbations. Before getting bogged down in calculational 
details, we first summarise this section. The spin-0 equations are complicated by the addition 
of 5^ modes which add a new degree of freedom. In order to tackle these equations, we use 
the smallness of m/Mpi, discussed in section 8.3.4, to solve the perturbations order-by-order, 
along the lines of the approach in Ref. [116]. At lowest order in m/Mpi, the perturbations 
and 5(f) have the same solutions as in the standard slow-roll inflation in general relativity. 
These can be substituted into the ^ equation of motion to solve for ^ at lowest order, which 
we then substitute back into the $ and 5(p equations at ^(m/Mpi). 

The instability found in the spin-1 perturbations reappears, and occurs in essentially the 
same region of parameter space. We then assume that the parameters are such that this 
instability is absent, in which case ^ is roughly constant. We solve for the metric perturbation 
$ and find that neither its amplitude nor its scale-dependence are significantly changed from 
the standard slow-roll case. In particular, we calculate two key inflationary observables: the 
scale-dependence of the $ power spectrum, Us, and the tensor-to-scalar ratio, r. 

Surprisingly, the first corrections due to the aether-scalar coupling enter at 
Up to first order in m/Mpi, the aether-scalar coupling has no effect on cosmic perturbations 
on superhorizon scales, assuming that m/Mpi is small compared to unity and that the per¬ 
turbations are produced during a slow-roll quasi-de Sitter phase. A corollary of this is that 
superhorizon isocurvature modes, a generic feature of coupled theories, are not produced 
by the aether-scalar coupling up to ^(m^/Mpj). Because of the smallness of m/Mpi, any 
deviations to Us and r caused by the aether-scalar coupling are unobservable to the present 
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and near-future generations of CMB experiments. 

Since the pure ee-theory terms in the perturbed Einstein equations carry two powers of 
(which is proportional to m) and so only begin to contribute at ^(m^/Mpj), we will not 
recover the cosmological perturbation results of pure ae-theory by taking any limits, as we 
only work in this section to ^(m/Mpi). The effects of ae-theory on the spin-0 perturbations are 
mild, amounting essentially to a rescaling of the power spectrum amplitude that is 
and is degenerate with s [127]. 


8.4.1 The Spin-0 Equations of Motion 


In order to eliminate nonphysical degrees of freedom, we need to specify a choice of coor¬ 
dinate system with no remaining gauge freedom. We choose to work in Newtonian gauge, 
where — iT® = 0. The equations of motion are relatively simple in this gauge, and the 
perturbation $ has a simple interpretation as the relativistic generalisation of the Newtonian 
gravitational potential [139]. Hereafter we will drop the spin-0 superscripts. 

The 0-0, 0-i, and i-i Einstein equations, respectively, are 


- Cj 

47rG, [-^'5cj)'-a%5cj)) = [3^^-SnGcCim^k^^ 


+ 8nGcCim^k{V' + ^V) — 8nGc&^kV 

+ 4:nGcmaVe^[^'^-3,^5cj)) 

1 k - 

- {kM’^ — k'l'O = —4>'54> — aAV + Cim^ a~^{ak^)' 

8nG 2 


1 - - 

- Cim^- \- -maVgA,(p'V 

a 2 


4nGc [^'5(p'-a^%5(p) = [3^^ - A) ^ + 3^^'- 2M’4j'- 4/" 


8nGcm^^ 

r 


(8.103) 


(8.104) 


■Cl23k^($-|-^) 


3m" 


-|-47rGc- AVgfjQ (34*^ — 33^$ -H fc V). 

a 

-AnGcma [Vg^i3M’5(p + 5(p') + Vg^^^'5(p'^ 

+ 47iGcm^Vgg^ [3A5(p-^'{341'-3J^^ + kV)] . 


(8.105) 






8.4. Spin-0 Cosmological Perturbations: Instability and Observability 


213 


The off-diagonal i-j Einstein equation, unmodified by the coupling between the aether and 
scalar, gives a constraint, 

k\^ + ^) = Ya-\a^kV)', (8.106) 

where y = IdnGm^Cis was defined in section 8.3. We may eliminate'T and its derivatives by 
the constraint (8.106) and its conformal-time derivatives. 




T'" = 7'(aA:) ^ 







(8.107) 

(8.108) 


where, as for the spin-1 perturbations, we have defined ^ = aV and A = - M’'. Note the 

presence of third derivatives of ^ in the expression forwhich could severely complicate 
the Einstein equations at ^(m^/Mpj). 

Einally, the v = i aether equation of motion is, using equations (8.106) to (8.108), 


r+ 

Cim-^ + ay 


a{l-y]A-^maVe^^'\ 

Cim^ + ay j 




cim^ + a 1 

—k{a^) — 


ma^Va 


Cim^ + ay 


2 Cim^ + ay 




(8.109) 


where, as before, tildes indicate the usual ae-theory constants modified by appropriate factors 
of \Vee. 

We can perform a consistency check by observing that these reduce to 5 T^y for a single 
scalar field in general relativity [139] when the aether is turned off (in the limit m ^ 0), as well 
as 5 Ti^y and the (^-equation of motion in ae-theory [127] in the limit V{0,(p)^ Vitp)- 


8.4.2 The Instability Returns 

To lowest order in the constraint equation (8.106) tells us simply that the anisotropic 

stress vanishes: T' = —Taking this into account, the 0-/ Einstein equation at lowest order in 
m /Mpi is 


{a^)' — ^7iGa4i'54)- 


(8.110) 
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The V = i aether equation of motion (8.109) is, dropping terms of ^(m^/Mpj), 




2mci V Cirri'^ J 


k{a^]' 


a^Ve<j,k 

2mci 


5(p 


( 8 . 111 ) 


where 


qo)2. 


Cim^ + ay 


Cl23 

Cl 


1 + 1 




( 8 . 112 ) 


is the same spin-0 sound speed as in flat space (cf. section 8.1) to first order in m/Mpi. In de 
Sitter space this becomes, using equation (8.110) to replace [a^J with Sep, 




^ _ k 
2mcilPT^~ 


oc \ 7 1 Ved) 

1 -I -^ 1 4nG(p - 


Cim" 


2 mci 


5(p 

(8.113) 

/T-Z 


to lowest order in the slow-roll parameters and m/Mpi. 

Combined with the perturbed Klein-Gordon equation, ^ and Sep obey coupled oscillator 
equations. However, to zeroth order in m/Mp\, the scalar field is unaffected by the aether 
perturbations, so on superhorizon scales 5^ is constant up to slow-roll corrections, resulting 
in the standard nearly scale-invariant power spectrum. This is consistent with the fiat-space 
case discussed in section 8.1, where it was found that the coupling to the aether does not 
destabilise the scalar modes. Taking 5^ to be constant and restricting to superhorizon scales, 
cf^kx < 1, equation (8.113) is solved by 


^ = C+t"+ -f A:5^ 



a \ m Cl 

Cim^J Mpi Vg^Mpi 


where C± are arbitrary constants, and 


(8.114) 




Veepfp 

2mciH^ 


(8.115) 


As with the spin-1 perturbations, the spin-0 piece of V = ^/a can either grow or decay 


^'^The aether coupling will still enter the perturbed Klein-Gordon equation at this order through the potential 
terms. 
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exponentially (in cosmic time). In this case it will grow if 

2mciH^ 


(8.116) 


This is exactly the same as the condition, A > 2, for the spin-1 modes to he unstable. The 
real condition for instability may be slightly different, as A > 2 could violate our assumption 
that m/Mpi is small; however, the additional terms would only change some 

multiplicative factors, and not by orders of magnitude. 

As in the spin-1 case, we can most easily see the effect of unstable aether modes on the 
metric perturbations through the off-diagonal i-j Einstein equation (8.106). If V blows up 
exponentially then so will -I- T*, and the metric perturbations will overwhelm the FLRW 
background. 


8.4.3 The Small-Coupling Limit 


From here on we will assume that the aether perturbations are stable, so that 


%<!>$ 

2mciH^ 


< 2 . 


(8.117) 


This can be further split into two dominant cases, |A| < 1 and A < -1/4. There are regions in 
parameter space which are not covered by these cases, such as A ~ 1, but these are likely to be 
highly fine-tuned as many of the parameters which enter A have no relationship to each other 
a priori. Consequently we should consider various values of A on an order-of-magnitude 
basis. 

|A| < 1 corresponds to the limit where the coupling 11^^ | is small compared to the mass 
scale CimH^/<p. Assuming that the background relations for the slow-roll parameters hold as 
in GR (which we will explore more rigorously in section 8.5 for a particular potential), then we 
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have s = AnGcp^/H^ up to ^(m/Mpi), and this limit can be written as 

\Ve,j,\ Cl m 

H ^/s Mpi 


(8.118) 


In this limit, the term C_t" in equation (8.114) is constant up to slow-roll corrections, as is 
the term proportional to 5^, while C+t"+ decays as t. 

This case should be qualitatively similar to ae-theory as it makes the aether-scalar coupling 
very small. However, we might be worried by the appearance of a in equation (8.114). 
The limit Vg^ 0 does smoothly go to ae-theory. The aether perturbation ^ only appears, to 
&{m/Mpi), in the 0-i Einstein equation. 





(8.119) 


The Vg^ in the 0{m/Mpi) term will cancel out the problematic Vg^ in the solution for Taking 
A ^ 0 and substituting in the solution (8.114), this becomes 


+ M’^!^4:nG(p'5(p 




Cl 



( 8 . 120 ) 


The corrections enter at ^(m^/Mpj) and are negligible for the purposes of this analysis. There¬ 
fore the limit |A| < 1 should only differ from ae-theory at ^(m^/Mpj) < 10“^®. 

It is worth mentioning that for small but finite A there will be new effects on extremely 
large scales, k<Vg^. These may or may not be observable, depending on the scales covered 
during inflation. 


8.4.4 The Large-Coupling Limit: The Evolution Equation 

One interesting case is left: a large coupling with opposite sign to or A < 1/4. We will 
consider this for the rest of this section. However, we should mention that the sign of ^ 
depends on initial conditions, and if this sign condition were not satisfied, then (as discussed 
in section 8.4.2) the aether-scalar coupling would drive a severe tachyonic instability. Hence 
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such a large coupling may not be an ideal part of a healthy inflationary theory. 

In this large-coupling case, both of the terms in equation (8.114) are decaying and we 

are left with 


k5(j) 

$ 

\/47rG 


l + ( 


k5(j), 



( 8 . 121 ) 

( 8 . 122 ) 


where we have dropped the 0{m/M^\) terms in the coefficient of the 5(j) term. Recall that 
equations (8.114) and (8.121) have been derived for superhorizon perturbations in the slow- 
roll limit. Hence we will focus our analysis on superhorizon scales, and while we will leave the 
behaviour of the scale factor unspecified in this subsection, it is worth keeping in mind that 
our results may not be valid in spacetimes that are not quasi-de Sitter. Using this solution for 
we can write the 0-/ Einstein equation to ^(m/Mpi) as 


$'-fJ^$ = 47rG + maVg^^ 5cp. 


(8.123) 


It is an interesting result that we can write the 0-f Einstein equation in geometrical terms as 




(8.124) 


to both zeroth and first order in G{m /Mpi). This does not hold, however, to higher orders, and 
might not hold away from quasi-de Sitter space or on subhorizon scales. 

Next we solve the metric perturbation $ to &[m/Mp\). Our master equation is the sum of 
the 0-0 and i-i Einstein equations, dropping a term which is negligible on superhorizon 
scales. 


8nGa^%5cj)=^" + 6J^f^' + 2{3J^^-A)f^ 

+ 4nGmaVetj, — Scp'^ 
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- 4nGmaVe,j,^(j)'5(j) + ... 


(8.125) 


where we have dropped terms of ^(m^/Mp^) and higher. 

Equation (8.125) becomes an evolution equation for $ after using equation (8.123) to 
rewrite the pieces proportional to 5cj) and 5^' in terms of $ and its derivatives. We can 
also use the background equations of motion to remove % in favour of other coefficients 
appearing in equation (8.125). For this latter step, we start with the background relation 
(using equations (2.87) and (2.88) and assuming (p is gravitationally dominant) 


A — - M"' — AnG ^(p'^ + maVe^cp'^ . 


(8.126) 


Taking the conformal-time derivative, we find (dropping ^(m^/Mpj) terms, as we do through¬ 
out) that 

A' f maVe^\ S” maVe,j,J^ 

2- T7^]^ + - j^ + maVe^^. (8.127) 

Using the background Klein-Gordon equation, we obtain an expression for which includes 
contributions from the aether-scalar coupling. 


■2a^V(j, — I ^ -i-4J^ j cp' + maVe(j, 


(8.128) 


In deriving the previous two expressions we have made use of the assumption that 


Ves _,/n rn Ve^ 

ma^^e -^< 1 . 

(p' Mpi H 


(8.129) 


We can immediately use equations (8.123) and (8.128) to remove 1^ and the 5(p terms 
from equation (8.125). We can also take the conformal-time derivative of equation (8.123) to 
find, using equations (8.123) and (8.128), an expression for 5(p', 


1 X r Y A' 

ATiG(p'5(p' = +\^ -A\^, 


(8.130) 
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where we have dropped the 0{m/Mn) term as 5(j)' only appears in equation (8.125) at that 
order. Using these relations, as well as the definition of A, the sum of the 0-0 and i-i perturbed 
Einstein equations (8.125) becomes 








$ 


maVe^ 








Simplifying, we find the evolution equation for $ to ff{m /Mpi), 


(8.131) 





^'+ I U = 0. 


(8.132) 


This is a surprising result. This is exactly the equation obeyed by ^ in single-field slow-roll 
inflation in the absence of a coupling to any other fields [139]. Coupling to new fields gener- 
ically introduces source terms to this equation, signalling the introduction of isocurvature 
modes. We have therefore shown that, to first order in m/Mpi, the scalar-aether coupling does 
not produce any isocurvature modes on superhorizon scales during slow-roll inflation. 

What would happen if we included higher-order terms? The pure ae-theory terms do 
not change equation (8.132) [127]. This is understandable because the aether tracks the 
background energy density, precluding the production of isocurvature modes. However, we 
have introduced new coupling terms in the Einstein equations at ^(m^/Mpj) and higher 
which could potentially produce isocurvature modes. It is currently unclear whether the 
unusual cancellations that led to the result (8.132) will hold at these orders. 

The solution to equation (8.132) is well-known [139], 


$ = C 





(8.133) 


where C is a constant. The remarkable fact that the 0-/ Einstein equation can be written in the 
form (8.124) to either zeroth or first order in m /Mpi means that to first order, the relationship 
between $ and 5(p is the same as in the case without the aether. Using equation (8.133) to 
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find a~^ia^y and plugging that into equation (8.124), we can determine the constant C, 



(8.134) 


exactly as in general relativity. 

The amplitude of 5^ is determined by quantising it in a (quasi-)de Sitter background on 
subhorizon scales, k » aH, and imposing a Bunch-Davies vacuum state. The variable 5(p 
is coupled to the spin-0 aether perturbations, as discussed in section 8.1, and its dispersion 
relation is modified by /r = Ve^iO, 0). However, the fiat-space stability condition constrains this 
to be less that the fiat-space mass of the scalar, Mq = 0), up to an 0[\) factor. Therefore, 

if the initial conditions are set at scales k » Mq (which follows from k:> a H since Mq H), 
then A:»/i as well, and the scalar at these scales behaves as it does in the case with no aether. 
We therefore see that the scalar and metric perturbations, Sep and are exactly the same as 
in general relativity up to &[m/Mpi). 

8.4.5 The Large-Coupling Limit: CMB Observables 

Let us finally connect these calculations to observations. As mentioned at the beginning of 
this section, the two key inflationary observables currently accessible to CMB experiments 
are the spectral index of the primordial power spectrum, Us, and the tensor-to-scalar ratio, r. 

We have seen that, surprisingly, neither of these will be affected by the aether-scalar 
coupling at ^(m/Mpi). Any new effects must therefore enter at earliest at ^(m^/Mpj). To 
discuss these effects, we split $ into zeroth-, first-, and second-order pieces. 



(8.135) 


Using this expansion, the power spectrum of $ is 



(8.136) 
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The deviation from scale-invariance, Us, is defined by 


Ms - 1 = 


d\nA\ 

dink ’ 


(8.137) 


where the dimensionless power spectrum is 


A 


2 

$ 



(8.138) 


In GR, the deviation from scale-invariance is —2s — r]. Using the results 


dln^Qjj 

dink 

dln^l 

dink 


— -3-2s — r], 

= - 3 -f(n,-1)2, 


(8.139) 

(8.140) 


where (Uj - 1)2 is the spectral index of ^ 2 , and assuming that $2 is not too much larger than 
$gr> the spectral index to second order in m/Mpi is given by 

7 tn $2 

ns-l^-2s-iq+\ — \ ^[2s + iq + ins-l) 2 ]+.... (8.141) 

yMpiJ 

Finally, we consider the tensor-to-scalar ratio, r, defined by 

r = (8.142) 

where is the dimensionless power spectrum of the spin-2 perturbations, Pure ae- 
theory effects contribute a constant rescaling to the tensor spectrum which only becomes 
important at ^(m^/Mpj) [127]. Recall that the coupling between the aether and (j), however, 
has no effect on the tensor perturbations as none of the coupling terms contain spin-2 pieces, 
so the tensor spectrum A^ is unchanged apart from the aforementioned (small) rescaling. 
Therefore, r is modified by a factor 

AL 


r 
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where Tgr is the tensor-to-scalar ratio in the absence of the aether-inflaton coupling. Using 
the expansion (8.135), we find that the corrections to r are small, 


— = 1-2 
^GR 


r 



(8.144) 


What size are the corrections to - 1 and r? As discussed in section 8.3.4, m/Mpi is no 
larger than ~ ^(10“^), barring any special cancellations among the c, . We constructed the 
expansion of ^ so that 4*2 is comparable in size to ^o- We assume that there are no effects such 
as instabilities at ^(m/Mpi) which would cause this construction to fail (the one instability 
that we have found in the spin-0 modes, discussed in section 8.4.2, has been assumed to 
vanish, by making the coupling either very small or of the opposite sign to ^). The Planck 
sensitivity to r is about 10“\ and about 10“^ to - 1 [36,188]. 

We see that the first corrections to $ enter at ^(m^/Mp^). This is constrained by other 
experiments to be a tiny number, placing any coupling between (p and 0, which is not already 
ruled out, far outside the current and near future windows of CMB observability. 

8.5 Case Study: Quadratic Potential 

8.5.1 Slow-Roll Inflation: An Example 

The arguments so far have been made for a general potential V{6,(j)) with only minimal 
assumptions. In order to be more quantitative, we will now look more closely at a particular 
form of the potential for which the inflationary dynamics are known and relatively simple. 

The Donnelly-Jacobson potential [129] contains all terms relevant to the dynamics at 
quadratic order in the fields and is given by 



(8.145) 


A term proportional to 9 would contribute a total derivative to the action and hence be 
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nondynamical (note that the potential enters the Friedmann equation through V—OVe, not V 
itself), while a term proportional to 6^ could he absorbed into C 2 and would only renormalise 
Gc. We take /i > 0 as the theory is invariant under the combined symmetry fj, ^ —{jl and 
(p ^—(p. Any dynamics with /r < 0 can be obtained by flipping the sign of (p. 

This is simply m^cp^ chaotic inflation with an extra force that pushes (p towards negative 
values [129]. In the case where the scalar field has no mass term, cp possesses exact shift 
symmetry, (p ^ (p + const., and this theory essentially becomes 0CDM, a dark energy theory 
in which ji is related to the dark energy scale and, importantly, is protected from radiative 
corrections by the existence of a discrete symmetry [116, 117]. Interestingly, in the special 
case where the aether is hypersurface-orthogonal, this theory also admits a candidate UV 
completion in the consistent nonprojectable extension [113-115] of Hofava-Lifschitz gravity 
[112]. In that case, however, the spin-1 modes we have discussed vanish. This is because the 
aether can be written as the (normalised) gradient of a scalar field corresponding to a global 
time coordinate, so it possesses no spin-1 modes. A similar coupling was also considered in 
Ref. [194]. 

The equations of motion, in conformal time, are 

^^^^^a^[M2cp2 + cp'^a-2y (8.146) 

_ 2(p'^a-^ - 3—(p'] , (8.147) 

3 V a J 

0^cp" + 2^cp'^M^(p+3^mijia. (8.148) 

Normally, we can obtain a slow-roll inflationary solution to leading order by neglecting 
^2 = in the Friedmann equation (8.146) and in the scalar evolution equation 

(8.148). The same applies in this theory; we now briefly justify this. 

A slow-roll inflationary phase requires H to be changing slowly, and for inflation to be 
successful it needs to last at least 50-60 e-folds. This is guaranteed by making sure the 

cannot just drop p" as it contains a term like Hp. It is easiest to drop the second derivative piece from 
the cosmic time scalar evolution equation and then move to conformal time. 
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slow-roll parameters 

_ H _ 

s s' 

irs~Ws’ 


(8.149) 

(8.150) 


are both very small compared to unity. For convenience we will work in cosmic time [t — 
f adz) here. The slow-roll parameters are 
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we can calculate the slow-roll parameters, 
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(8.152) 
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(8.156) 


s — 35 + y5^^^, 


rj — -3A 


652 -I-y 
652 -t-2y 


(8.157) 

(8.158) 


The usual slow-roll conditions, (p^ < and ^ < 3H(j), are equivalent to 5 < 1 and A < 1, 
respectively. We generally expect M <Hm order for the inflaton to produce perturbations. 
As we will see below, both the requirement that inflation end and the stability considerations 
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discussed in section 8.1 impose iJ,< jic- When combined, these conditions imply 7 " < 1. So, 
under these reasonable assumptions on M and ji, in order to ensure f < 1 and rj <. I we 
simply need and ^ < 3H(j) as usual. Note, however, that the usual identifications of 

s and rj in terms of the potential will be changed if the scalar-aether coupling is large enough 
for Y to be comparable to 5^/^. 

In the slow-roll limit, the Friedmann and Klein-Gordon equations are, respectively, 

1‘inGc , 

^ ^ ^^Ma\(j)\, (8.159) 


Notice the appearance of /ic defined above. During slow-roll, it is related to the inflationary 
dynamics by 


M^\d)\ 


(8.161) 


The value of /r/jUc is physically significant because it determines the stability of the slow- 
roll solution. The number of e-folds that inflation lasts tends to infinity as fic, which 
corresponds to exact de Sitter expansion; for /i > the slow-roll solution is unstable and 
grows without bound [129]. We will therefore always consider inflationary solutions with 

There is an additional constraint on ji/jic from the spin-0 stability constraint (8.34). Sub¬ 
stituting the definition of [ic into this gives the constraint 


^ <247rGcm2ci23 


247iGm^Ci23 

l-|-87rGa 


(8.162) 


The same constraint was derived along similar lines in Ref. [129].^® Since C 123 < 1 and a>0 
(see section 8.1, as well as Refs. [126, 127]), this is more restrictive than simply jj, < jic, unless 
m is comparable to, or greater than, the Planck scale—a possibility that seems to be ruled out 

^®As discussed in footnote 5, our action and potential differ from those in Ref. [129] because we give the aether 
units of mass while their aether is dimensionless. Taking the different definitions of c,-, m, and /r into account, 
our constraint agrees with theirs. 
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by experiments, as discussed in section 8.3.4. Since experiments suggest m/Mpi < 10“^, jU//ic 
must be so small that inflationary dynamics would be effectively unchanged by the coupling, 
unless cancellations among the c, conspire to weaken the bounds on m. 


8.5.2 The Instability Explored 


Specialising to the Donnelly-Jacobson potential and using the slow-roll equations (8.159) 
and (8.160), we can write the spin-1 equation of motion (8.69) to first order in the slow-roll 
parameters as 

r + = (8.163) 

with A given by 




M 2 

Ip 


rni? / Mil 

-f sgn{[jL(p]^=^ \l ^ + Ci3 + 3c2 
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-f 0{e). 


(8.164) 




Here, as in section 8.3.4, we have defined the dimensionless coupling a by 


= 2ci23M^(7 = 24nGcm^Ci23(JIJil, 


(8.165) 


so that fiat-space stability of the spin-0 modes implies cr < 1. 

As with the general case, the solution (8.81) to equation (8.163) is written in terms of the 
first Hankel function of order v, where 


2 1 

v^ = -+A (8.166) 

4 

Repeating the analysis of section 8.3.3, we pick a single mode which leaves the sound horizon 
at some conformal time t, , which we could take to be the start of inflation. We pick a mode 
which crosses the horizon early because Vk{T) is largest at small k (with t held fixed), so this 
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is one of the larger superhorizon modes available. We want to calculate the contrihution of 
this mode to the 0-i component of the stress-energy tensor. If it exceeds the background 
energy density, then this would indicate a violation of isotropy and signal an instability in the 
background solution which we found in section 8.5.1. 

Using the slow-roll scalar equation and our expression (8.160) for we find 




We can substitute this directly into equation (8.92) to find one of the terms in the contribution 
that this mode makes to T^i, 



We can now get a more quantitative handle on the argument made in section 8.3.3. Assuming 
V > 3/2, the exponential in (v —3/2)N is likely to overwhelm the other terms within the 50-60 
or more e-folds that will occur after t,-, which we take to be near the start of inflation. While 
several terms in equation (8.168) are likely to be several orders of magnitude smaller than 
unity, including M/H, and possibly M/Mpi, it is unlikely that these could be so 

small as to overwhelm the exponential terms and the gamma function. Hence, for v > 3/2, we 
expect that the slow-roll background solution we found in section 8.5.1 is unstable, rapidly 
dominated by perturbations in the aether field generated by its coupling to the infiaton. 

In section 8.3.4 we found that v can surpass 3/2, even by several orders of magnitude, if 
the aether VEV, m, is suitably small compared to the Planck scale. Armed with a specific form 
for the potential, we now briefly clarify that argument and use it to place constraints on the 
aether-scalar coupling parameter, ji. 

If V > 3/2 then A > 2, where A is defined in equation (8.164). It is not difficult to check that 


requirement for v to be greater than 3/2 in the first place. 
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this is the same as the A we discussed for a general potential, equation (8.70), which we wrote 
in various forms in section 8.3.4. There, we found that for A to be positive we needed to be 
positive. With the Donnelly-Jacobson potential, we have an expression for cp, equation (8.160). 
From there we see that iKp is only positive (assuming [i < Hc) when [icp is negative. We will 
take [1 to be positive and then ask if (p can be negative (the opposite case is trivial, as the 
theory has combined jj, —[jl, cp ^—cp symmetry). This is not at all uncommon, and depends 
only on initial conditions. The dynamics for this inflationary model are encapsulated in 
[(p,(p) phase portraits for a range of fi/iic in Ref. [129]. Per equation (8.165), fi/fic is of order 
(m/Mpi)crF2. Because observations suggest m < Mpi (see section 8.3.4), /i should be very 
small compared to jic even when cr approaches unity. Hence, the phase portrait for /r = 0 in 
Ref. [129] will be very close to the dynamics we are interested in. In the exact ji — O case, there 
are as many inflating paths with (p <0 as (p > 0, because when ji — O, the equations for (p emd 
cp have combined cp ^—cp and cp ^—cp symmetry. The next phase portraits show a tendency, 
increasing with [jl, for inflating paths to live in the (p>0 half of the phase plane. Since /r < jic, 
nearly half of all initial conditions leading to viable inflation have [JLCp < 0. 


Considering each piece in A on an order-of-magnitude basis, and taking sgnifKp) = -1, 
we have 
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(8.169) 
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Evidently, A will be greater than 2 if the smallness of m compared to the Planck scale exceeds 
the (square of the) smallness of the scalar mass, M, compared to the Hubble scale. 
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(8.170) 


where we have assumed that m/Mn <€. 1. While M/H should be small, there are no limits 
on how small m /Mpi should be before the collider scale, and moreover, as discussed in 
section 8.3.4, there are already likely to be stringent experimental constraints on m/Mpi 
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(although these tend to depend on the C/ not cancelling out in particular ways). 

The tachyonic instahility discussed here and in section 8.3 is absent when /i and (p have 
the same sign. In this case, the coupling only serves to dampen aether perturbations. For the 
Donnelly-Jacobson potential, what remains is effectively just m^(p^ inflation. If the signs of /r 
and cp are different, or if we were to demand that inflation be viable for all initial conditions, 
then the absence of this instability puts a very strong constraint on the magnitude of ji, 



(8.171) 


From the background dynamics, we expect + &{m/Mp\) ~ ^(10“^), while the 

absence of gravitational Cerenkov radiation constrains m/Mpi < ^(10“^), in the absence of 
certain cancellations among the C;. Thus the constraint on [i is of the order 


^ < ^( 10 -^). 
tl 


(8.172) 


This should be compared to the previous strongest constraint on /r, the flat-space stability 
constraint discussed in Ref. [129] and section 8.1, 



(8.173) 


8.6 Summary of Results 

We have examined cosmological perturbations in a theory of single-field, slow-roll inflation 
coupled to a vector field that spontaneously breaks Lorentz invariance, looking both to 
explore the effects of such a coupling on inflationary cosmology and to place constraints on 
it. The particular model is Einstein-aether theory, a theory of a fixed-norm timelike vector 
called the “aether,” coupled to a canonical scalar field by allowing its potential to depend on 
the divergence of the aether, 6 — In a homogeneous and isotropic cosmology, 0 is 

related directly to the Hubble rate hj H — 0/3m. This construction allows H to play a role in 
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cosmological dynamics that it cannot in general relativity, where it is not a spacetime scalar. 
Moreover, it is a fairly general model of coupling between a fixed-norm vector and a scalar 
field. In particular, while many couplings can be written down which are not captured by a 
potential V[0,(p], all such terms have mass dimension 5 or higher and therefore fall outside 
the scope of the low-energy effective theory. Similarly, Einstein-aether theory is the most 
general low-energy theory which violates boost Lorentz invariance in the gravitational sector 
[125]. 

Around a slow-roll inflationary background, this theory can possess a tachyonic instability. 
The instability is present if the norm of the aether, effectively the Lorentz symmetry breaking 
scale, is sufficiently small compared to the Planck mass, and the aether-scalar coupling 
is suitably large. In this region of parameter space, assuming a technical requirement on 
the initial conditions, scalar and vector perturbations both grow exponentially, destroying 
the inflationary background. Demanding the absence of this instability for generic initial 
conditions places a constraint on the coupling which is significantly stronger than the existing 
constraints, which are based on the stability of the perturbations around flat space and the 
viability of a slow-roll solution. Hence this constraint is by far the strongest on an aether-scalar 
coupling to date, with the assumption that the scalar drives a slow-roll inflationary period. 

The root of the instability is the smallness of the aether VEV, m, compared to the Planck 
mass. The noncoupled terms in the aether Lagrangian each have two factors of so these 
aether terms will come with a factor of (m /MpiY in the Einstein equations. Terms involving 
two or more 6 derivatives of the scalar field potential will also enter the Einstein equations 
with these factors or higher. However, terms associated with the coupling which only 
has one aether derivative, will only have one power of m/Mpi and so will generically be larger 
(depending on the size of 1^,^) than the other aether-related terms. In the aether equation of 
motion, this coupling term will be a power of Mpi/m larger than the other terms for the same 
reason. When the coupling is sufficiently large, it is exactly this term that drives the instability. 

If the instability is absent, then observables in the CMB are unaffected by the coupling at 
the level of observability of current and near-future experiments: the corrections are smaller 
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than This is due partly to the smallness of the aether norm relative to the Planck 

scale, hut is exacerbated hy the presence of unusual cancellations. Solving for the spin-0 
perturbations order by order in the aether VEV m/Mpi, no isocurvature modes are produced 
at first order. This is unexpected, as isocurvature modes are a generic feature of multi-field 
theories. Stronger yet, the perturbations are completely unchanged at first order in m/Mpi 
from the case without any aether at all. This is largely a result of unexpected cancellations 
which hint at a deeper physical mechanism. 

It is unclear whether these unexpected conclusions hold to higher orders in m /Mpi. 
At several new coupling terms enter the perturbed Einstein equations, equa¬ 

tions (8.103) to (8.105), with a qualitatively different structure to the terms which appear at 
^(m/Mpi). The possibility therefore remains that the isocurvature modes that one would 
expect from the multiple interacting scalar degrees of freedom might re-emerge at this level. 
If they do, they would be severely suppressed relative to the adiabatic modes. 

Beyond perhaps an extreme fine-tuning, there does not seem to be a subset of the parame¬ 
ter space in which observable vector perturbations are produced without destroying inflation. 
Even if such modes could be produced, they do not freeze out on superhorizon scales and are 
sensitive to the uncertain physics, such as reheating, between the end of inflation and the 
beginning of radiation domination. Therefore any observational predictions for vector modes 
would be strongly model-dependent. Nonetheless, it should be stressed that the line between 
copious vector production (that quickly overcomes the background) and exponentially decay¬ 
ing vector production is so thin, as it depends on unrelated free parameters, that there is no 
reason to expect this theory would realise it. 

While we made these arguments for a general potential, we also looked at a specific, simple 
worked example, the potential of Donnelly and Jacobson [129]. This potential includes all 
dynamical terms at quadratic order, and amounts to m^ip^ chaotic inflation with a coupling 
to the aether that provides a driving force. It contains many of the terms allowed for the aether 
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and scalar up to dimension The constraint this places on the coupling /r = 



(8.174) 


is stronger by several orders of magnitude than the the next best constraint [129], 



(8.175) 


It is worth emphasising again the two conditions for our constraint to hold. First, the scalar 
must drive a period of slow-roll inflation. Second, the instability can be avoided if jj, and 
(j) have the same sign. Consequently, the new constraint applies only if we demand that 
inflation be stable for all initial conditions. If such a coupling were to exist, this constraint 
could be seen as a lower bound on m, to be contrasted to the many upper bounds on m in 
the literature. 


i®One could also add a tadpole term proportional to 0 and a term proportional to (p^d. The latter would 
effectively promote the coupling /i to p + const, x 0, so during slow-roll inflation the effective [i would still he 
roughly constant. 






If it be true that good wine needs no bush, ’tis true that a good play needs no epilogue; 
yet to good wine they do use good bushes, and good plays prove the better by the help of 
good epilogues. 


Rosalind, You Like It, Epilogue 


Discussion and Conclusions 



Our concern throughout this thesis has been the use of cosmology as a laboratory for testing 
gravity. In the first part, we focused on the theoretical and cosmological implications of 
endowing the graviton with a finite mass, leading to theories either of massive gravity, in 
which there is a single, massive graviton, or massive bigravity, in which two gravitons, one 
massless and the other massive, interact with each other. We have studied both of these 
variants, while focusing on bigravity as it is a better setting for studying Friedmann-Lemaltre- 
Robertson-Walker cosmologies. In the second part, we turned our attention to theories of 
gravity which violate Lorentz invariance, using the early-Universe inflationary era to place 
constraints on Lorentz violation. Below we will summarise in more detail the problems we 
have sought to address and the results obtained in this thesis, before ending by examining the 
implications of this work for future studies of modified gravity. 


9.1 Problems Addressed in This Thesis 

The expansion of the Universe is accelerating. If we assume that the Standard Model of 
particle physics, perhaps augmented by one or more massive dark matter particles, describes 
the matter content of the Universe, then general relativity and, indeed, our intuitive notions 
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of gravity suggest that the expansion should slow down, as galaxies and dark matter peirticles 
exert their gravitational pulls on one another. To accommodate the surprising acceleration, 
one must either give up on the assumption that we are using the right description of matter, 
and introduce an “exotic” dark energy, or explore the possibility that gravity is modified from 
general relativity at extraordinarily large distances. 

The simplest example of such a modification, which is to simply allow for a nonzero cos¬ 
mological constant, fails in one crucial respect: the cosmological constant receives quantum 
corrections which are many orders of magnitude larger than the value required to explain the 
accelerating Universe. This motivates theories of modified gravity which self-accelerate and 
are technically natural, i.e., in which whatever parameter value and theory structure lead to 
the accelerated expansion are not destabilised by radiative corrections. Ghost-free massive 
gravity, particularly in its bimetric form, appears to be such a theory: the special potential 
structure, necessary to eliminate the dangerous Boulware-Deser ghost, and the small graviton 
mass both seem to be stable against quantum corrections [86] , while bigravity in particular 
can easily accommodate self-accelerating solutions which are in as good agreement with 
background observations as ACDM is [79, 80, 85]. 

The expansion histories predicted by bigravity can be very close to ACDM and to each 
other. To break this degeneracy, it is necessary to go beyond the FLRW assumption and 
consider the evolution of perturbations. In so doing, we can derive predictions for the 
formation and growth of cosmic structure, and thereby open up new avenues of testing tbe 
theory. Tbis is especially important now, as the advent of next-generation galaxy surveys 
like Euclid will allow observations of structure formation to place constraints on modified 
gravity and dark energy comparable to or exceeding those from purely geometric observations. 
Moreover, by expanding our study to linear perturbations we can test the stability of the FLRW 
solutions. Indeed, results in the literature prior to the work undertaken in this thesis had 
found evidence for cosmological instabilities in massive bigravity [82,135,136]. It is therefore 
crucial to undertake a detailed analysis of precisely when cosmological solutions in bigravity 
are and are not linearly stable. 
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The presence of two metric tensors in a theory brings with it significant conceptual 
challenges. In general relativity, the metric tensor is not just a dynamical field which obeys a 
particular equation of motion. It has a deep physical role as the geometry of spacetime. In a 
bimetric theory, where there is a second dynamical metric and in which the action treats both 
metrics on equal footing (up to parameter interchanges), there is a danger of demoting the 
metric from its geometric office. In most studies of bigravity, this issue is swept under the rug 
by only coupling matter to one metric. This can be identified as the physical metric, as matter 
follows its geodesics, while the other is simply a rank-2 tensor coupled to it in order to modify 
gravity. The geometric appearance of this second metric, such as the fact that its kinetic term 
is given by the Ricci scalar, is then something of a coincidence, not a sign that we should 
assign any particular geometric interpretation to this field. These considerations immediately 
raise the question of whether matter can be consistently coupled to both metrics, and, if so, 
whether we must give up on a geometric understanding of gravity in the process. 


Turning towards early times and high energies, Lorentz invariance is one of the most 
fundamental ingredients in the best-tested physical theories, but may have to be given up 
in order to resolve the seemingly-intractable problem of quantum gravity. Lorentz violation 
is very well constrained in the Standard Model [111], but much less so in other sectors of 
physics, such as gravity, inflation, and dark matter. In this thesis we have sought to improve 
our understanding of the first two, by allowing a Lorentz-violating vector field to couple both 
to the metric cind to a slowly-rolling scalar field. Such interactions would be expected to arise 
if physics is Lorentz-violating, barring some symmetry forbidding them, and so an exploration 
of inflationary dynamics and perturbations can provide a strong test of the effects and sizes 
of such couplings. 
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9.2 Summary of Original Results 

9.2.1 Massive Gravity and Bigravity 

Chapters 3 and 4 dealt with cosmological perturbation theory in singly-coupled massive 
bigravity. In chapter 3 we set up the formalism for perturbations and investigated the linear 
stability of FLRW solutions. By employing a carefully-selected gauge intended to ease the 
process of integrating out auxiliary degrees of freedom, we reduced the ten Einstein and 
conservation equations to a system of two equations for the two independent, dynamical 
perturbation variables. At small scales, the coefficients of this system become effectively 
constant, allowing us to determine the eigenfrequencies of the system and thereby identify 
stable and unstable models. Only one corner of bigravity, the infinite-branch model, 
turns out to be linearly stable at all times. 

In chapter 4 we applied our perturbation formalism to observations, studying the evolu¬ 
tion of structure in the subhorizon and quasistatic approximation. This limit is applicable for 
most modes observable on linear scales and is of great utility for deriving testable predictions, 
as it reduces many differential equations to algebraic ones. In bigravity, this allows us to 
directly relate every metric perturbation to the density contrast, 5, which in turn obeys an 
evolution equation that can be solved numerically. Doing this, we obtained predictions for 
structure growth for each one- and two-parameter bimetric model with viable background 
evolution, written in terms of three common modified-gravity parameters: the growth rate 
and index, / and y; the modification to Newton’s constant, Q; and the anisotropic stress, 
rj. Note that the quasistatic approximation, necessary to ensure that time derivatives of the 
fields drop out, under the assumption that they are of order Hubble, is not always applicable 
for unstable models. In these cases, the results can be applied (up to possible changes in 
initial conditions for the growth rate) at later times when modes are stable. The predictions 
hold straightforwardly for the stable infinite-branch model, where we find large devia¬ 

tions from general relativity, which should be easily measurable by Euclid, at all times and 
for all parameters. This provides a rare example of a modified-gravity model which can be 
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unambiguously ruled out against ACDM. 

In chapters 5 to 7 we deal with doubly-coupled bigravity from both observational and 
theoretical standpoints. In chapter 5 we studied a simple example of a theory in which the 
coupling of matter to both metrics removes any notion of an effective physical metric for all 
but the simplest fields. As an example of the new structures which might need to be used to 
describe spacetime in such a theory we showed that the dynamics of point-particles can be 
described in terms of a Finsler manifold, a non-Riemannian structure in which there is an 
effective metric which depends, in addition to the two gravitational metrics, on the particle’s 
own velocity. 

The particular double coupling studied in chapter 5 suffers from the Boulware-Deser ghost 
and therefore should be seen as a toy model, rather than a potentially-realistic description 
of the Universe [72, 73]. An improvement can be made by coupling matter to a specially- 
selected effective metric for which the ghost seems to reappear only at energies outside 
the domain of validity of the theory [73]. This new double coupling formed the basis of 
chapters 6 and 7. In the former, we studied the effects of this coupling in bigravity, deriving 
the Friedmann equations, comparing the simplest models to data, and investigating certain 
special parameter regimes. These models can agree well with observations; if they have 
improved stability properties, they will increase the space of cosmologically-viable bimetric 
models. 

In the latter chapter, we investigated this double coupling in the context of single-metric 
massive gravity. This theory famously suffers from a no-go theorem ruling out dynamical, fiat 
FLRW solutions around a Minkowski reference metric [94]. Other choices of spatial curvature 
or reference metric lead to instabilities [95-100]. The no-go theorem is circumvented by 
the double coupling [73], but we found that there is still a serious problem in comparing 
these theories to observations. The avoidance of the no-go theorem relies crucially on the 
use of fundamental fields as the sources for Einstein’s equations. The usual description of 
matter on large scales by an effective fluid, such as pressureless dust, is insufficient and, in the 
absence of any other fields, leads back to nondynamical cosmologies just as in singly-coupled 
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massive gravity. This can only be avoided either by adding new degrees of freedom, such as 
a scalar field, or by treating dust as a field. We examined in further depth the case in which 
a scalar field is added to the theory, coupling it to the effective metric in order to unlock 
dynamical solutions. If the matter fluid also couples to the effective metric, then significant 
pathologies arise due to the highly-constrained nature of the theory. If it couples instead to 
the gravitational metric, one can indeed obtain sensible late-time acceleration, but for many 
parameter choices it is driven as much by the scalar field’s potential as it is by the massive 
graviton. This is because the avoidance of the no-go theorem requires the scalar field to have 
a potential, and the theory’s constraints force it to evolve along the potential in a fixed way 
throughout cosmic history. 

9.2.2 Lorentz-Violating Gravity 

In chapter 8 we studied Einstein-aether theory, a general low-energy model of boost violation 
in the gravitational sector, during inflation. Einstein-aether is a vector-tensor theory in which 
a vector field, or aether, couples to gravity nonminimally while a Lagrange multiplier imposes 
the constraint that it be fixed-norm and timelike. This spontaneously picks out a preferred 
frame at each point in spacetime. 

If such a field exists, it should also be coupled to a scalar inflaton, unless forbidden by 
symmetries of either field. In the case of single-field, slow-roll inflation, this does not appear 
to be the case. We showed that all operators up to mass dimension 4 coupling the scalar to the 
aether can be included simply by allowing the scalar’s potential to depend on the divergence 
of the aether. This model had previously been introduced as a way of allowing the cosmic 
dynamics to depend directly on the Elubble rate, which is not a spacetime scalar in general 
relativity. 

We derived the cosmological perturbations for Einstein-aether theory coupled in this 
way to a scalar field, and calculated the evolution of spin-0 and spin-1 perturbations during 
inflation. In both cases, sufficiently large couplings lead to a tachyonic instability for nearly 
half of all inflationary trajectories. Demanding the absence of this instability for all initial 
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conditions leads to new constraints on the size of the aether-scalar coupling which is at least 
five orders of magnitude stronger than the previous best constraint. 


9.3 Outlook 

What is the next step for modified gravity? Progress on both the observational and theoretical 
fronts is due to be made over the next decade. Upcoming data from Euclid, the Dark Energy 
Survey, and the Square Kilometre Array, among others, will measure both the geometry and 
the structure of our Universe with unprecedented precision. This will allow for more precise 
tests of modified gravity, especially in the regime of perturbations. Euclid, for example, will be 
able to measure the anisotropic stress, rj, within about 10% if it is not strongly scale-dependent 
[147]. Recall from chapter 4 that the one linearly stable bimetric model, the infinite-branch 
PiPi model, has an anisotropic stress that deviates from GR by at least 30%, and as much as 
50%, at all times. Clearly we are on the verge of entering a new experimental era in modified 
gravity. 

On the theoretical side, much more remains to be done. The dark energy problem has 
not yet reached the status of mass generation in the Standard Model, in which the Higgs 
mechanism was for decades a clear frontrunner solution, or inflation, where a single, slowly- 
rolling scalar field is the consensus best available framework. There is no “theory to beat” in 
modified gravity and dark energy. Observationally, a cosmological constant is simple and fits 
the available data, but its theoretical issues may be too problematic to overcome. The simplest 
extension of general relativity is, in a sense, scalar-tensor theories, but none of these theories 
has emerged as being clearly better than the rest; instead, we have the Horndeski theory, an 
extremely general framework that covers all scalar-tensor theories with second-order (i.e., 
healthy) equations of motion [145]. While massive gravity and bigravity are closely related 
to Horndeski theory—the helicity-0 mode of the massive graviton in the decoupling limit is 
of the Horndeski form—they are ultimately a qualitatively different approach to modifying 
general relativity. So is Einstein-aether, and any number of other theories with, e.g., higher 
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dimensions or nonlocality. None of these has emerged as a front-runner; ideally, we would 
demand self-acceleration, technical naturalness, agreement with observations, and some 
degree of aesthetic virtue. While massive gravity, and its himetric extension in particular, 
checks off many of these boxes, the difficulty in obtaining stable cosmological solutions in all 
but a handful of nonminimal models continues to provide impetus to find something better. 

It is impossible to guess what developments may emerge out of the aether. ^ Science is 
notorious for pulling out its most significant discoveries when least expected and when the 
research of the day seemed not to be leading up to them at all. However, we can make some 
educated guesses as to which directions may soon be extended and, perhaps, provide fertile 
ground for new opportunity. Recently, healthy scalcir-tensor theories beyond Horndeski have 
been discovered [195-197]. The presence of nontrivial constraints means that, while the 
equations of motion contain higher derivatives, the propagating degrees of freedom obey 
second-order equations. As discussed in section 6.4.1, a partially-massless theory of gravity 
would immediately satisfy practically all of the requirements we would impose on a theory 
of modified gravity, as its gauge symmetry would both determine and protect a small value 
of the cosmological constant. Whether a partially-massless theory in fact exists has been a 
topic of intense debate, with no resolution reached to date [76, 178, 179, 181, 198, 199]. If 
a theory is found which possesses the partially-massless symmetry nonlinearly around all 
backgrounds, it would unquestionably be a boon for modified gravity. 

Let us examine some of the possible directions opened up by the work described in this 
thesis. In chapter 3, we found that most of the bimetric models with viable cosmological 
backgrounds suffer from an instability at early times, leaving behind only a rather small and 
nonminimal corner of the parameter space. This means that, for any given mode, linear 
perturbation theory breaks down. It means nothing more and nothing less. At the moment 
this is a technical rather than a fundamental difficulty. While the instability may signal a 
breakdown of the background, it may also be the case that the instability is cured at higher 
orders. It is already known that nonlinear effects make spatial gradients of the helicity-0 mode 


^Pun intended. 
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of the massive graviton shallower; this is the Vainshtein mechanism which eliminates the 
fifth force and reduces physics to general relativity in regions like the solar system [54] . It is 
conceivahle that similar effects will work to cure the large time derivatives of cosmological 
perturbations at higher orders. However, this will require either a study of higher-order 
perturbation theory in bigravity or the development of a more sophisticated treatment of 
nonlinear effects. Going to higher orders is not simple, as the complicated theory even at 
linear order shows. Even then, there is no guarantee that a Vainshtein-like mechanism, or any 
other effect which would cure the instability, would arise at second order. Note that N-body 
simulations, which are commonly used to study modified-gravity effects in highly nonlinear 
regimes, usually rely on the quasistatic approximation which fails precisely in these unstable 
models. Indeed, by taking the quasistatic approximation, one could never have discovered the 
instability: the quasistatic equations, presented in chapter 4, show no sign of the instability 
as the relevant terms are taken to vanish. Instead we needed, in chapter 3, to use the full 
cosmological perturbation equations from the start. To date, only tentative steps have been 
taken towards removing the dependence on the quasistatic limit, and only for a very simple 
class of scalar-tensor theories with few of the complications of massive bigravity [200, 201]. It 
seems, then, that understanding whether the cosmological instability dooms most bimetric 
models and, if not, how to calculate observables will require the development of new methods 
to better understand nonlinear perturbations. 

Throughout chapters 5 to 7 we circled the question of how to construct a healthy, doubly- 
coupled bimetric theory, i.e., a sensible physical theory that treats both metrics on entirely 
equal footing. Two options immediately jump to mind. One is to simply include minimally- 
coupled matter Lagrangians for each metric with the same matter, as we studied in chapter 5 
and was introduced in Ref. [77] . This was later shown to reintroduce the Boulware-Deser ghost 
at arbitrarily-low energies [72, 73]. Another possibility is to couple matter to the nonlinear 
generalisation of the massless mode. 



(9.1) 
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Such a coupling was both introduced and shown to possess the Boulware-Deser ghost in 
Ref. [71]. 

In chapter 6 we discussed the only known double coupling in which the ghost does not 
re-emerge at low energies, introduced in Ref. [73] and derived separately and extended to 
multimetric setups in Ref. [163]. While, as we have shown in chapters 6 and 7, this coupling 
leads to a rich phenomenology, the status of the ghost has been somewhat contentious 
and it is not yet clear exactly at which scale it appears [73,161,162]. The consensus seems 
to be that the mass of the ghost is at least at the strong-coupling scale of the theory, and 
possibly parametrically larger. If the latter, then one could argue that this is ghost-free taken 
as an effective field theory, since the ghostly operators are outside the theory’s regime of 
validity [73,162] . Indeed, it is not hard to conceive of the ghost being cured by higher-energy 
operators which, by definition, are ignored below the strong-coupling scale. However, even if 
the effective theory is healthy, the nonlinear massive gravity we are using might not correctly 
describe it.^ Consider, as a very simple example, the fourth-order equation of motion 

e'x +x + co^x — 0. (9.2) 

This has a ghost, by virtue of Ostrogradsky’s theorem, and if the limit e 0 is taken in the 
equation of motion, the ghost disappears as the equation becomes second-order. However, if 
we take the same limit in the solutions to equation (9.2), we do not obtain the correct solutions 
to the second-order equation of motion; the extra two modes do not decouple from the 
theory.^ For more details on this example, we refer the reader to section 3 of Ref. [51]. 

It is therefore not clear whether the presence of a ghost in doubly-coupled massive gravity 
and bigravity, even if we assume it is not excited at the energy scales for which we are solving, 
would lead to the same solutions as the ghost-free low-energy theory would. This should 

2We thank Angnis Schmidt-May for helpful discussions on the following points. 

^More precisely, for nonzero k the frequency of the additional modes goes to infinity in the limit e ^ 0, 
and the actual value of the limit is not well-defined. In a theory where all modes have positive energy, infinite- 
frequency modes are impossible to excite with finite energy. An infinite-frequency negative-energy mode, which 
is what we have here, would however become even easier to produce [51] . 
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not be a problem witb tbe FLRW solutions, for wbicb tbe absence of tbe gbost at all scales 
bas been proven [73] , but is a sign that we need to continue to search for a doubly-coupled 
theory which is truly free of the Boulware-Deser ghost. At present it is unclear whether such 
a theory exists and, if so, what form it will take. If the coupling is not defined by minimally 
coupling matter to an effective metric, then the problems and potential solutions discussed 
in chapter 5 may turn out to be quite relevant. 

However, a bird in the hand is worth two in the bush, and if we momentarily set aside 
our pining for a fully ghost-free doubly-coupled theory, we will notice that we have a double 
coupling, discussed in chapter 6, which at the very least is good for cosmological solutions. Cir¬ 
cumstantial evidence suggests that the new matter coupling should allow for cosmologically- 
stable models in a much larger region of the parameter space than the singly-coupled theory 
does. Recall from chapter 3 that the instability which plagues most singly-coupled models 
appears specifically for small y. This is a problem in any finite-branch model (and most 
models only have viable solutions on the finite branch) because the quartic equation (2.58) 
requires y = 0 at early times. We have seen that in the doubly-coupled theory, y starts at yS /a 
and hence is always nonzero if yc ^ 0. While the modified Einstein equations will change 
the perturbation behaviour, if the rule of thumb that instabilities occur for small y holds, 
then double coupling should open up many more stable models. Moreover, recall, cf. equa¬ 
tion (3.32), that the P 2 model is always stable in singly-coupled bigravity. The problem with 
this model is that it is ruled out by observations and theoretical conditions at the background 
level. In the doubly-coupled theory, the P 2 model has an acceptable background as long as 
yS /a is above a threshold value, cf. figure 6.2. Finally, in one simple example, when y = y^ at 
all times and as a result the metrics are proportional, we know that the perturbations must be 
well-behaved. We found that the effective Friedmann equation in this case reduces to that 
of ACDM. It can, moreover, be shown that for any solutions to the doubly-coupled theory in 
which g^v and are related by a conformal factor, the theory reduces to general relativity,^ 
and that this equivalence to general relativity extends to linear perturbations [174]. This 


^In particular, matter couples only to the massless spin-2 field. 
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implies that the perturbations around the conformal cosmological solutions we have found 
must be the same as in general relativity, and hence are stable. A full investigation of the 
cosmological perturbations in this theory is therefore very well motivated in the search for 
cosmologically-viable models of bigravity. 



We are all in the gutter, but some of us are looking at the stars. 


Oscar Wilde, Lady Windermere’s Fan 




Appendices 


God does not care about our mathematical dijficulties. He integrates empirically. 


Albert Einstein 




A 

Deriving the Bimetric Perturbation 

Equations 


In section 3.1.1 we presented the full linearised Einstein and fluid conservation equations for 
massive bigravity in a general gauge and without making a choice of the time coordinate (i.e., 
with a general lapse). Since these equations are arrived at by a fairly lengthy calculation, in 
this appendix we detail their derivation. 

The perturbations of the Einstein tensor are standard and can be found in, e.g.. Ref. [139]. 
In order to calculate the fluid conservation equations we only need to know the linearised 
Christoffel symbols. For the g metric, these are 


n N I . 
00 ^ 2 ^ 


a 


r" -E, + -R 


0/ M 2 ^ N ^ 








H{1 +Ag) + -Ag — HEgj 5ij + -didjBg + HdidjBg 


a 

N' 


did jpg 


TL^—d^ I -—E, + F, + HF, 


“ a'^ \2a^^ ^ 




';)i^‘j^i<:^sE5‘kdjAg 5jkd‘Ag + d‘djdkBg^ ^djkd'Fg. 


(A.1) 
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Note that in background, only rJJo, and are nonzero. Similarly we can find the /-metric 
Christoffel symbols, fyp, 

XI. 

00 X 2 ^ 

~n ^ Tl Y 

K, = ^\^E, + -F, 

f°, = ^ [K{l+A[)+\A,-KE^S:i + \d,diBi + Kd,diB! 

X,,}\X . A 

^•o«=fS‘\^--Ef+Ff + KFfj 

= + S’l + la'SjBf 

f]t=\ {S\iatAf + 5‘tStAf-5ttS'Af + S‘d,dtB,) - j5,tS‘Ff. (A.2) 


The bulk of the work lies in calculating the perturbations of the mass term. We will focus 
on deriving the linearised field equations, i.e., calculating the matrices rather than the 
second-order action. The metric determinants to linear order are 

detg = -N^a^ {l + Eg+3Ag+X^Bg^ , 
detf^-X^Y^[l + Ef + 3Af + V^Bfy 

The matrix X = ^g~^f is defined in terms of the two metrics as 

X%X^ = g'^^/pv. (A.5) 

Its background value is simply 

X%=x, 

X'j=y5/'. (A.6) 


(A.3) 

(A.4) 
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Using this we can solve equation (A.5) to first order in perturbations to find 


— X I 1 + ~Ai? I, 


X°,- = 


2 

1 Y 




x+yN 

X'o = T7^- (y^'F/-x^'Fg), 


X +y a 


l + ^AAj 5‘j + ^d^djAB 


(A.7) 


The trace of this is 


[X] — X ( 1 + I +y 




(A.8) 


Similarly we can solve for the matrix Y = although we do not write its components 

here as they can be found by simply substituting [N, a, g] with [X, Y, f] and vice versad 

We now need the matrices X^ and X^ and their traces in order to compute the matrices 
appearing in the mass terms of the Einstein equations. For X^ we find 


(X2)%=x2(1 + AF), 

(X^f / = ^ {y^iPg - xdiFf] , 

(X2)'o = ^ (y^‘F/-x^'Fg), 

(X^)^ =y2 [(1 +AA)5^j + d^djAB] , (A.9) 

with trace 

[X^] =x2(l + F/-Fg) + y2 [3(1 + AA) + V2AB] . (A.10) 

X^ is given by 

(X3)% = x^(^l + ^AFj, 

fit may also be calculated explicitly or by using the fact that Y is simply the matrix inverse of X, which can 
be easily inverted to first order. 
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N 

a 


(X%-=y^ 


(^l + ^-AA^y + ^-d^djAB , 


(A. 11) 


with trace 


[X=^]=x^(^l + ^AF)+y^ 


3 l^l + ^AA^ +^V2AB 


(A. 12) 


and can be determined trivially from these. 

Having calculated these we can determine the matrices g~^f) and 

Two helpful intermediate results are 


1 

6 


- ([X]2- [X^]) =y2 [3(1+AA) + V''AB] 


+ xy 


3 I 1 + -(AA + AF)J T-V^AB 


f 3 1 

([X]=^-3[X][X2]+2[X=’]) =y3 IT-AAT-V^AB 


Txy'" 3 1+AA + -AF +V^AB 


(A. 13) 


(A. 14) 


To obtain those intermediate results and the 0-0 and i-j components of the Y matrices, it 
saves a lot of algebra to write the traces, 0-0 components, emd i-j components of the various 
X matrices in terms of 


Cl = X, C 2 = 3y, 

5i = ^AF, 52^]-AA + Iv^AB, SXj^dd'dj-lsYvAAB. (A.15) 
2 2 6 yz 6 J 

Finally, the matrices i[|^)v(\/ g~^f) defined in equation (2.41) are given by 


• n = 0: 

tSv(vYT)=s'' 


(A. 16) 
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• n — \\ 


-y 


‘;;k,(\/s^) ' ^ 


3 I l + ^AAj +^V2A5 


x+y a 


{ydiFg-xdiFfY 

[yd‘Ff-xd‘Fg], 


Y‘,j{/Fy)=-x[i+\AEy'j 

l + iA/ll 6‘i + \[6‘,V‘-0‘Sj)AB 


• n — 2: 


^^2)0 =y' [3(1 + AA) + V2A5] , 


[FFF]=■ 


x+yN 
2y X 


(vFF) =y‘ [d + A^)5'j +1 (S‘jV^-0‘0j) AB 


+ 2xy 


1 + ^(AA +A£)j S‘j + ^ [S‘jV^ - AB 


• n — 3: 


^3)o(^F^)=-y 

4(v^)= " 

^3)o(^F^) = 


3 1 , 

1 + -AA + -V^AB 
2 2 


2 Y 


x+y N 

y^ X 


X +y a 


{ydiFg-xdiFf^, 

[yd‘Ff-xd‘Fg], 


Y^^.[/^)=-xy^ (^l+AA + UEyY + ^{5YV^-d‘dj)AB . 


(A. 17) 


(A. 18) 


(A. 19) 


Plugging these into the field equations, (2.39) and (2.40), we obtain the full perturbation 
equations presented in section 3.1.1. 


































B 

Explicit Solutions for the Modified Gravity 

Parameters 


As discussed in section 4.2, the modified gravity parameters Q and rj in singly-coupled massive 
bigravity have the Horndeski form, 


T] 

Q 


ho 


hi 


1 + A:2h4^ 


l + k^hsj’ 

(B.l) 

1 + k^hA 


l + k^hs J ’ 

(B.2) 


while the growth of structures can be written in these terms as 


'6 + H5- 


1Q c- 

2 n M2 

' g 


(B.3) 


Hence the five hi coefficients allow us to determine all three modified gravity parameters 
we consider in chapter 4. They are given explicitly by 




1 

i+y^’ 


_(l +y^) (^1 +3^33/"^ + (6^2 — 2^4) y2 -y 3 (^1 — ^ 3 )y^)_ 

—Pi + (3^2 ~ A) y^ + ^ 9^3) y* + (3^0 “ 15^2+2^4) y 2 + (3^3 — 7^1) y 2 


(B.4) 

(B.5) 
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hs — —2 (4^2^3 ~ 2^3^4) y® + 3 (2^1 — 3 ^ 3 ) + (4^o^3 — 19^2^3 “ + 2 ^i^ 4) y^ 

”61 +y L 

+ (“3^^ — 18^2 ^ 9^3 +4^0^2 +2^2^4) y^ + 3 ((^0 “ 3^2) ^3+^1 (A ^ 9^2)) y^ 

+ (“7^4+2^3^!+2 (^0 “ 3^2) ^2) y “ ^1 (^0+ ^2) > (B.6) 

2^3^4y® + 2 (^3^2 ~ A ^2 ~ 3 (^1 — 2 ^ 3 ) ^3^ y® + (^1 ( 6^2 ~ 4 ^ 4 ) +3^3 (—2^o + 9^2 + A)) 

+ 2 [^3^^ — 2^3^! +18^1 + 9^3 — 3^o^ 2 ^ 3^2^4) y^ + (37^1^2 + 27^3^2 ^ 9 ^o^3 ^ 9^i^ 4) y^ 

+ 2 ^10^^ — 3^3^i — 3 (^0 — 3^2) ^2] y+3^1 (^0+ ^2) > (B.7) 

^5 = - ^^ + ^3 (24^4^2^9^2_g^2j^jj^ 10 

flQ flj [ 

+ 2 [^18^2 “ 12^4^2 (^^^^3 ~ 96^1^3 + 2 ^^j + 6^3 ( 4^1 + 5 ^ 3 ) ^ 4^2 

+^3 (99^3^ - 81^1^2 + 18^f^3 -3^2^3 - 12 ^oAA - 8 ^i^ 4 )) y' 

- ( 72^3 W2 - A) Pi + (-72^1 + 72^^! + ( 117^1 - 16^2 j + p2 ^gg^^ _ 

+9^3 ^ — 60^2 + ^PoPI 4~ 12^4^2 ~ 96^3 ^2 + 19^0^3 4~ ^PlPij ) y 

- 2 [sePsPl - [b^Pl - 36yS4yS2 + 69yS| + 8y8f) Pi 

“2^3 ^123^2 ~ 76^4^2 + 3 ( 13^1 + ^4 ( 4^0 + ^ 4 )]] Pi 

-3 (72y32^ - SePiP^ + {255P^ + ^Pl) Pi -2\plPiP2 - 9Pl + ^PlPl 
yPa (~12^2 '^^PiPl ~ ^2>PlP2 + 'iPlpPjy^y^ 

+ (24 (3^2 - 2^4) Pi + P 3 i-72Po + 507^2 - 77Pa) PI 

+ [876PI - 5 O 8 P 4 PI + 600Pi P 2 + ^8PIP2 - 3 Pi Pi + Po (-72yS| + 48PiP2 - 69 Pi) ) Pi 
+3^3 (24^2^0 4~ ly-228PI + 16^4^2 + 9^3 j ^0 + 9^2 (48^2 ~ ^PaP 2 ~ 7PI))) y^ 

+ 2 [l8Pl + 45P3PI + (477y32 - 36y8o^2 - 170y84^2 + l^Pl + Opf) Pi 
+3^3 [p26Pl — 42^0^2 + 6 P 4 P 2 — 3Pl + 2 PoPi) Pi 
+6 [Po — 3P2) P2 (~19^2 3-3P0P2 + 2P4P2 + 6Pl))y^ 

+ ((441^2 - 79Pi) Pi + P3 {-33Po - 8P2 + SSPi) Pi 

+ [648PI - l56PoPl - QOPiPl + 9PIP2 + 9PoPl) Pi + 36 {Po - 3P2) PIP3) y" 

+ 2^1 (39^f - 26P2PI + (167^1 - 18^4^2 + 15^2 - 3Po (^2 + A)) Pi - I2P0P2P3) y" 



+ Pi (36P2 + 9 ^ 1 ^ 3^2 + 3^0 (3^^^ — SPsPi — 4^2) Pi (112^2 “ 9^4)) y^ 
+ 2p^ (11^1 ~ ^PsPi + 12^2) y + 3 ^^ (^0 + ^2) > 
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(B. 8 ) 


where we have introduced two additional coefficients, he and hj, defined as 

he^ 3 m^a^{l+y^^[Pi +P 3 y^ + 2 P 2 y) [P^ + PsP^y^ + [ 3 Pi - P^Pz - 3 (^1 - 2^3) ^3) 

+ {3P1P2 + 12P3P2 — 3P0P3 — 2P1P4) + [ppl+PsPi — 3 {Po — 3P2) P2) y^+5^i^2y) I 

(B.9) 

, (^1 + y (2^2 + ^3y)) (3^oy ^ + y ^ (3^2y (y ^ - 5) + ^3 (3 - 9y2) - ^4y (y2 - 2)) + (6y ^ - 7y ^ - l) ) 

=-TV?- 

(B.IO) 


While this notation is inspired hy Refs. [144, 147], we have defined hi, 6,7 differently. 






c 

Transformation Properties of the 
Doubly-Coupled Bimetric Action 


Here we describe the transformation properties of the action (6.1) and how they determine the 
number of physically-relevant parameters for the doubly-coupled bigravity theory discussed 
in chapter 6. 


C. 1 Rescaling the Action 


Let us write the action as 


Ml r , _ Ml r , _ 

S = — —\ d^x/^d^i?(g)-—J 

+ + J det(geff,4*), (C.l) 


where 

4 

v[/r^-,pn) (C.2) 

n=0 
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is the usual dRGT interaction potential and satisfies 

V^-detgy = ^J-detfV • (C.3) 

Due to this property, the action is invariant under 

g/xv ^ *flivy Pn ^Pa—hj (C.4) 

since the effective metric 


S'ill = cPSiJ^v + 2ap g^oK + P^fiy^v 


(C.5) 


is also invariant under this transformation, as shown helow. Because the overall scaling of the 
action is unimportant, there is a related transformation which keeps the action invariant, hut 
only involves the ratio of Mg and M/, 


Mg Mf 



Mf „ Mg „ 

Ti J Ti J • 


M 


M 


f 


(C.6) 


These transformations reflect a duality of the action since they map one set of solutions, with 
a given set of parameters, to another set of solutions. 


Not all of the parameters Mg, Mf, a, p, and Pn are physically independent. In effect, 
we can rescale these parameters, together with g^v and f^v, to get rid of either Mg and M/ 
or a and p. In the end, only the ratio between Mg and Mf, or a and p, together with p„, 
are physically meaningful. The two parameter choices are physically equivalent and can be 
mapped to one another. We now describe the two scalings that give rise to the two parameter 
choices. 


Under the scalings 



C.l. Rescaling the Action 
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/’ 


2 2 2 

m a m y 


Pn 


Pn, 


(C.7) 


the effective metric becomes 




(C.8) 


while the action becomes 


S = 


Ml r , _ Ml r , _ 

—'^^^y-detgi?(g)- —I d^x^-det/B (/) 

+ m^M^^ f <i^xv/-cietg^^„e„ S~^f] 

J n=0 


+ d‘^X V^-dcigeffi^'m (geff, 4') • 

V 


(C.9) 


The effective metric is thus uniquely defined in this parameter framework, while the ratio 
between Mg and M/ is the free parameter (in addition to the Pn)- For this choice of scaling, 
the action is invariant under 


gjiv^ ^ffivy Pn ^ Pi-m 


(C.IO) 


or, more generally. 


Mg Mf 


'MfV ” . fMf 


T,. Pn^ Pi-n 


M 


Mp 


(C.ll) 


If, instead, we apply the scalings 


SlJ,v 


M2 

g 


SgiV 


// 


m ^ m 




fiv 




Me^ff . 
Mj 

Ml 

— ^cP, 

Min 


f -K jr \ 

/ M f \ 

^"^[Wg] 

Ml 

— -B^ 

' m2 ' ’ 


(C.12) 



262 


Appendix C. Transformation Properties of the Doubly-Coupled Bimetric Action 


then the effective metric is still of the form (C.5), while the action becomes 

S = -^J d^x yj- detgi? (g) - ^ J d^x^-det/i? (/) 

+ m^Ml^ [ d^x^J-detg'^Pne„ 

J n=0 

+ J d^x ^J-detg^n^m (geff, 4') • (C.13) 

For this choice of scaling, only the ratio between a and yS, together with the I5„, is physically 
important (the effective coupling can be absorbed in the normalisation of the matter 
content). Under this form, the action is invariant under 


g/iv 


•// 






A—nr 


a • 


>p. 


(C.14) 


To move from the framework with Mg and M/ to the one with a and P , one simply performs 
the rescaling 






/XV i 





SjJ-V ’’ ^ g/iVi 


m 




(C.15) 


Each of the parameter frameworks has its advantages. In the Mg and M/ framework, 
there is a unique effective metric, and it is the relative coupling strengths that determine the 
physics. In the a and P framework, we have one single gravitational coupling, M^^, and the 
singly-coupled limits are more apparent in the effective metric. Note that the ratio between a 
and P only appears in the matter sector, whereas in the Mg and M/ formulation their ratio 
appears in both the matter sector and interaction potential. 
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C.2 Symmetry of the Effective Metric 

In this section, we show that the effective metric is symmetric under the interchanges 



(C.16) 


In order to do this, we take advantage of the fact that g^a^y is symmetric, i.e., gX = X^g, as 
shown in Ref. [71] . We will find it useful to discuss the metrics in terms of their vielbeins, since 
we are dealing with square-root matrices and vielbeins are, in a sense, “square roots” of their 
respective metrics. We use Greek letters for spacetime indices and Latin letters for Lorentz 
indices. The g- and /-metric vielbeins are defined by 



(C.17) 


(C.18) 


while the inverse metrics are given by g'^^ = el and similarly for /z^^. The vielbeins of 
gf^'’ are inverses of the vielbeins for g^,,, e“el — 51 and = 5^, and again similarly for the 
f^y vielbeins. 

We will assume the symmetry condition (also called the Deser-Van Nieuwenhuizen gauge 
condition) 



(C.19) 


where Lorentz indices are raised and lowered with the Minkowski metric. It is likely, though it 
has not yet been proven, that this condition holds for all physically-relevant cases. In four 
dimensions, it holds when g~^f has a real square root (proven in Ref. [202], where it was 
conjectured that this result is valid also in higher dimensions). Assuming this condition, then 
it has been shown [83] that the square-root matrix is given by 
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The inverse of this is clearly 


a V ' 


since then 


a a b V a v v 


The form of the inverse then implies 




(C.21) 


(C.22) 


(C.23) 


which will be a useful property when showing the symmetry of the effective metric. We also 
have 

= ea^L‘1.. (C.24) 

In order to show that gX = X^g, we must thus have 

eai^Ll^eayL^. (C.25) 


Notice that this is not exactly the same as eq. (C.19), since in the first case we contract 
over spacetime indices, whereas here we contract over Lorentz indices. The two symmetry 
conditions are, however, equivalent, as discussed in detail in [203] . 


An alternative way of seeing that gX = X^g is as follows. Since 


f,aK = KLaae^.Ll = = /,„X“, 


(C.26) 


we have 

(C.27) 

But / = gX^, so equation (C.27) can also be written gX^ = X^gX^, which implies 


gX = X^g. 


(C.28) 
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Using this property it is straightforward to show that the effective metric is symmetric 
under the interchange of the two metrics. The effective metric we study was introduced in 
Ref. [73] in the form 

Sfv = a^giiv+2aPga(iXl.) + P^fi,v. (C.29) 

Due to the symmetry property (C.28), we can write this without the explicit symmetrisation, 

S'fy = +2al3gai,^" + l3^fi,v. (C.30) 

Suppose now that we do the transformation 

gfiv^fiiv, a^p. (C.31) 


The effective metric becomes 

sfv = a^g,iv+2aPf^aif f~^g)v+P^Uv. (C.32) 

This can he brought into the original form for g®® using the matrix property 

f[\/g~^f] ^gg~^f[fg~^f] ^gfg~^f- (C.33) 

Combining this with equation (C.23) we get 

ff f~^g^g\/g~^f- (C.34) 

Applying this to equation (C.32) we see that the effective metric is invariant under the duality 
transformation (C.31). This ensures that the entire Hassan-Rosen action treats the two metrics 
on entirely equal footing when matter couples to g®®. Note that this duality does not hold for 
the single-metric (dRGT) massive gravity as it is broken by the kinetic sector. 










D 

Einstein-Aether Cosmological Perturbation 

Equations in Real Space 


In this appendix, we present the real-space equations of motion for the linear cosmological 
perturbations in Einstein-aether theory coupled to a scalar field as described in chapter 8. 


We have for the v = 0 component of the aether field equation (2.77) 


— 6(ci 3-f -f 6 c2 ^^ ^ 

-|- [(2ci -|- C2)V\i + C3{V\i + B\i] + 3c2^'-\-3{2ci^ + C2)^^j 

- C3($-S- - + w;.) - C2{V‘'. +3^") + 






3m- fa' ■y\r, i \ 

- — Veee { —-2^^^ \ [3^'-3^^+V\i) 


2 a 
1 a 

^ 2 m 
1 - 


a 

'^-54)') 


'S<t>] 


(j)'i3'ij'-3M’^ + V\i) + 3\ --2^^ I 5(j) 


a 


= 0 , 


(D.l) 
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and the v = i component is 


a „ a a \ a 

2 —^ — 
m‘^ \ a \ a 


[Bi - M) 




a 


(ci + — 




+ ~i^C3 + — CiVij^ — ,ij 


a' 


ol 


3 I — - 2 J ^"2 I (5. _y.^_ 3^/^ + 33^^ . _ yj .. 


+ \v,« 

+ l — %U'{Bi-V,)- 5 <l>j)= 0 . 

2m ^ ^ 


The combined aether-scalar stress energy tensor (2.81) has perturbations 


5r% = 2 —{ - 3 (ci 3 + Cia-i [a[B^ - ^),;]' 

+ (ci3 -T3c2)^(l^\i 3-3^0 + j 

cx 

3 YTX ^ 3 YTX 

+ —^Vee [3^'- 3+ +- ^Vg^dcp, 


a 

n 

5 T \=2 — 


a 
fa" 

~2 (ci 3 +3c2)^^+ (3c23-C 3) I — 

V a 


{Vi-Bi) 


- CiU -cia 

+^(-Ci + C3) [(B; - - Vn,ij] I 


m" 




3m^- fa 




■Vee 


a 


m - 


2^^ m- Bi) + — Vg^(p'iVi- Bi), 


a 


5 T'j — 2 —- -j (C133-3C2) 


a 


+ (X 


y I ^ 

- 2 I — 
a 


^5‘j-{cu+3c2)^^'5^j 


a‘^{^C2V^,k5^j + {c\3 + 3c 2)'^'5^j 3- —CulVfj + Vj'‘ 3-2/?"'.) 


3-{-^ + a^V^5(j)^ 




3- 7 Vee 




(a^ ( 3 ^' + V\k))' 


+ ^Veee{^^-2M’^]{3^'-3J^^ + V\k) 


(D.2) 


(D.3) 


(D.4) 
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+ — [Ve^{3J^5(P +5(P'- ^'^] + Ve^^^'5(P] 

d 

- f f a" , 

+ -^14„ \ 3 \-- 2 M ’ 


5(j) + 



(D.5) 


We can do a consistency check by choosing V[0,(p) — \p6'^ + V[(p). This corresponds 
to pure ae-theory, with C 2 rescaled to C 2 + yS, and a scalar field coupled only to gravity The 
cosmological perturbations in that model are presented in [127]. Our equations agree with 
the literature in this limit, as we would expect. 
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